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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 
1. TECHNICAL FIELD 

5 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

1 0 2. BACKGROUND OF THE INVENTION 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) a has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 

1 5 polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
based on the presence of a now well-recognized secretory leader sequence motif, as well as 

20 various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins that 
are known to have biological activity, for example, by virtue of their secreted nature in the case of 
leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

25 Identified polynucleotide and polypeptide sequences have numerous applications in, for 

example, diagnostics, forensics, gene mapping; identification of mutations responsible for genetic 
disorders or other traits, to assess biodiversity, and to produce many other types of data and 
products dependent on DNA and amino acid sequences. 

30 3. SUMMARY OF THE INVENTION 



1 
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The compositions of the present invention include novel isolated polypeptides, novel isolated 
polynucleotides encoding such polypeptides, including recombinant DNA molecules, cloned genes 
or degenerate variants thereof, especially naturally occurring variants such as allelic variants, 
antisense polynucleotide molecules, and antibodies that specifically recognize one or more epitopes 
5 present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 
1 0 sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 

hybridization (SBH), and in some cases, sequences obtained from one or more public databases. The 
invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid sequences 
are designated as SEQ ID NO: 1 - 1 1 and are provided in the Sequence Listing. In the nucleic acids 
1 5 provided in the Sequence Listing, A is adenine; C is cytosine; G is guanine; T is thymine; and N is 
any of the four bases. In the amino acids provided in the Sequence Listing, * corresponds to the stop 
codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
hybridize to the complement of SEQ ID NO: 1-11 under stringent hybridization conditions; nucleic 

20 acid sequences which are allelic variants or species homologues of any of the nucleic acid sequences 
recited above, or nucleic acid sequences that encode a peptide comprising a specific domain or 
truncation of the peptides encoded by SEQ ID NO: 1-11. A polynucleotide comprising a 
nucleotide sequence having at least 90% identity to an identifying sequence of SEQ ID NO: 1 - 1 1 or 
a degenerate variant or fragment thereof. The identifying sequence can be 100 base pairs in length. 

25 The nucleic acid sequences of the present invention also include the sequence information 

from the nucleic acid sequences of SEQ ID NO: 1 - 1 1 . The sequence information can be a segment 
of any one of SEQ ID NO: 1-11 that uniquely identifies or represents the sequence information of 
SEQ ID NO: 1-11. 

A collection as used in this application can be a collection of only one polynucleotide. The 
30 collection of sequence information or identifying information of each sequence can be provided on a 
nucleic acid array. In one embodiment, segments of sequence information are provided on a nucleic 
acid array to detect the polynucleotide that contains the segment. The array can be designed to detect 
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full-match or mismatch to the polynucleotide that contains the segment. The collection can also be 
provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
5 host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 
reverse or direct complements) according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing 
full-length genes, use for chromosome and gene mapping, use in the recombinant production of 

10 protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-1 1 or novel 
segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of SEQ 
ID NO: 1-1 1 or novel segments or parts of the nucleic acids provided herein are used in diagnostics 

1 5 for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al, 

Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a polynucleotide 
comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-1 1 ; a polynucleotide 
comprising any of the full length protein coding sequences of SEQ ID NO: 1-11; and a 

20 polynucleotide comprising any of the nucleotide sequences of the mature protein coding sequences 
of SEQ ID NO: 1-11. The polynucleotides of the present invention also include, but are not limited 
to, a polynucleotide that hybridizes under stringent hybridization conditions to (a) the complement of 
any one of the nucleotide sequences set forth in SEQ ID NO: 1—11; (b) a nucleotide sequence 
encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide 

25 which is an allelic variant of any polynucleotides recited above; (d) a polynucleotide which encodes a 
species homolog (e.g. orthologs) of any of the proteins recited above; or (e) a polynucleotide that 
encodes a polypeptide comprising a specific domain or truncation of any of the polypeptides 
comprising an amino acid sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

30 comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with biological 
activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth in 
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SEQ ID NO: 1-1 1; or (b) polynucleotides that hybridize to the complement of the polynucleotides of 
(a) under stringent hybridization conditions. Biologically or immunologically active variants of any 
of the polypeptide sequences in the Sequence Listing, and "substantial equivalents" thereof (e.g., 
with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence 
5 identity) that preferably retain biological activity are also contemplated. The polypeptides of the 
invention may be wholly or partially chemically synthesized but are preferably produced by 
recombinant means using the genetically engineered cells (e.g. host cells) of the invention. 

The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
10 hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The invention also provides host cells transformed or transfected with a polynucleotide of 
the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium under 

1 5 conditions permitting expression of the desired polypeptide, and purifying the polypeptide from 
the culture or from the host cells. Preferred embodiments include those in which the protein 
produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology. These techniques include use 

20 as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used as 
hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 

25 using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as expressed 
sequence tags for identifying expressed genes or, as well known in the art and exemplified by 
Vollrath et al, Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the 
human genome. 

30 The polypeptides according to the invention can be used in a variety of conventional 

procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
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antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
5 which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
1 0 expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
identification of subjects exhibiting a predisposition to such conditions. The invention provides a 
15 method for detecting the polynucleotides of the invention in a sample, comprising contacting the 
sample with a compound that binds to and forms a complex with the polynucleotide of interest for 
a period sufficient to form the complex and under conditions sufficient to form a complex and 
detecting the complex such that if a complex is detected, the polynucleotide of interest is detected. 
The invention also provides a method for detecting the polypeptides of the invention in a sample 
20 comprising contacting the sample with a compound that binds to and forms a complex with the 
polypeptide under conditions and for a period sufficient to form the complex and detecting the 
formation of the complex such that if a complex is formed, the polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
25 Furthermore, the invention provides methods for evaluating the efficacy of drugs, and monitoring 
the progress of patients, involved in clinical trials for the treatment of disorders as recited above. 

The invention also provides methods for the identification of compounds that modulate 
(i.e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides of 
the invention. Such methods can be utilized, for example, for the identification of compounds 
30 that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., bind 
to) the polypeptides of the invention. The invention provides a method for identifying a 
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compound that binds to the polypeptides of the invention comprising contacting the compound 

with a polypeptide of the invention in a cell for a time sufficient to form a polypeptide/compound 

complex, wherein the complex drives expression of a reporter gene sequence in the cell; and 

detecting the complex by detecting the reporter gene sequence expression such that if expression 

5 of the reporter gene is detected the compound that binds to a polypeptide of the invention is 

identified. 

The methods of the invention also provide methods for treatment which involve the 
administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
10 disorders as recited herein comprising administering compounds and other substances that 

modulate the overall activity of the target gene products. Compounds and other substances can 
effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
1 5 useful for the same functions known to one of skill in the art as the polypeptides and 

polynucleotides to which they have homology (set forth in Table 2); for which they have a 
signature region (as set forth in Table 3); or for which they have homology to a gene family (as 
set forth in Table 4). If no homology is set forth for a sequence, then the polypeptides and 
polynucleotides of the present invention are useful for a variety of applications, as described 
20 herein, including use in arrays for detection. 

4. DETAILED DESCRIPTION OF THE INVENTION 

4,1 DEFINITIONS 

25 It must be noted that as used herein and in the appended claims, the singular forms "a", 

"an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic and/or 
immunologic activities of any naturally occurring polypeptide. According to the invention, the 
terms "biologically active" or "biological activity" refer to a protein or peptide having structural, 

30 regulatory or biochemical functions of a naturally occurring molecule. Likewise 

"immunologically active" or "immunological activity" refers to the capability of the natural, 
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recombinant or synthetic polypeptide to induce a specific immune response in appropriate animals 

or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 

extracellular or intracellular membrane trafficking, including the export of secretory or enzymatic 

5 molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 

complementary sequence 3'-TCA-5\ Complementarity between two single-stranded molecules 

may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 

10 total complementarity exists between the single stranded molecules. The degree of 

complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 

15 stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 
cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 

20 are derived. The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment, " EMF, means a series of nucleotides which 
modulates the expression of an operably linked ORF or another EMF. 

25 As used herein, a sequence is said to "modulate the expression of an operably linked 

sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs is nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

30 The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 

"oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
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origin which may be single-stranded or double-stranded and may represent the sense or the 

antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 

sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 

(U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 

5 provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 

invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 

from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 

acid which is capable of being expressed in a recombinant transcriptional unit comprising 

regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

10 The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 

"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, more 
preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and most 
preferably at least about 17 nucleotides. The fragment is preferably less than about 500 

1 5 nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, preferably 
from about 15 to about 50 nucleotides, more preferably from about 17 to 30 nucleotides and most 
preferably from about 20 to 25 nucleotides. Preferably the fragments can be used in polymerase 

20 chain reaction (PCR), various hybridization procedures or microarray procedures to identify or 
amplify identical or related parts of mRNA or DNA molecules. A fragment or segment may 
uniquely identify each polynucleotide sequence of the present invention. Preferably the fragment 
comprises a sequence substantially similar to any one of SEQ ID NOs: 1-1 1. 

Probes may, for example, be used to determine whether specific mRNA molecules are 

25 present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
described by Walsh et al. (Walsh, P,S. et al, 1992, PCR Methods Appl 1 :241-250). They may be 
labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the art. 
Probes of the present invention, their preparation and/or labeling are elaborated in Sambrook, J. et 
al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY; or 

30 Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John Wiley & Sons, New 
York NY, both of which are incorporated herein by reference in their entirety. 
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The nucleic acid sequences of the present invention also include the sequence information 

from the nucleic acid sequences of SEQ ID NOs: 1-11. The sequence information can be a 

segment of any one of SEQ ID NOs: 1-1 1 that uniquely identifies or represents the sequence 

information of that sequence of SEQ ID NO: 1-11. One such segment can be a twenty-mer 

5 nucleic acid sequence because the probability that a twenty-mer is fully matched in the human 

genome is 1 in 300. In the human genome, there are three billion base pairs in one set of 

chromosomes. Because 4 20 possible twenty-mers exist, there are 300 times more twenty-mers 

than there are base pairs in a set of human chromosomes. Using the same analysis, the probability 

for a seventeen-mer to be fully matched in the human genome is approximately 1 in 5. When 

1 0 these segments are used in arrays for expression studies, fifteen-mer segments can be used. The 

probability that the fifteen-mer is fully matched in the expressed sequences is also approximately 

one in five because expressed sequences comprise less than approximately 5% of the entire 

genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment can 
15 be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a full match (l^-4 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 
eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probability that a twenty-mer with a single mismatch can be detected 
20 in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 
acid sequences. For example, a promoter is operably associated or operably linked with a coding 
25 sequence if the promoter controls the transcription of the coding sequence. While operably linked 
nucleic acid sequences can be contiguous and in the same reading frame, certain genetic elements 
e.g. repressor genes are not contiguously linked to the coding sequence but still control 
transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 
30 differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 
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The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 

synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino acid 

residues of at least about 5 amino acids, preferably at least about 7 amino acids, more preferably 

5 at least about 9 amino acids and most preferably at least about 17 or more amino acids. The 

peptide preferably is not greater than about 200 amino acids, more preferably less than 150 amino 

acids and most preferably less than 100 amino acids. Preferably the peptide is from about 5 to 

about 200 amino acids. To be active, any polypeptide must have sufficient length to display 

biological and/or immunological activity. 

10 The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 

have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, acetylation, 
carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the full 

1 5 length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide or 
protein without a signal or leader sequence. The "mature protein portion" means that portion of 
the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature protein 

20 portion may or may not include the initial methionine residue. The methionine residue may be 
removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for the 
mature protein coding sequence. 

The term "derivative" refers to polypeptides chemically modified by such techniques as 

25 ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
30 occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
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the sequence of the particular polypeptide with that of homologous peptides and minimizing the 

number of amino acid sequence changes made in regions of high homology (conserved regions) 

or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 

5 synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 

substitutions, such as the silent changes which produce various restriction sites, may be 

introduced to optimize cloning into a plasmid or viral vector or expression in a particular 

prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 

the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 

10 any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain 
affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

15 similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 

20 negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 
insertions, deletions, or substitutions of amino acids in a polypeptide molecule using recombinant 
DNA techniques and assaying the resulting recombinant variants for activity. 

25 Alternatively, where alteration of function is desired, insertions, deletions or 

non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate. 

30 Further, such alterations can be selected so as to generate polypeptides that are better suited for 
expression, scale up and the like in the host cells chosen for expression. For example, cysteine 
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residues can be deleted or substituted with another amino acid residue in order to eliminate 

disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the indicated 
nucleic acid or polypeptide is present in the substantial absence of other biological 
5 macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 

polynucleotide or polypeptide is purified such that it constitutes ai least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 

10 The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from at 

least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 

1 5 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means that 
a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in bacterial 
or fungal (e.g., yeast) expression systems. As a product, "recombinant microbial" defines a 

20 polypeptide or protein essentially free of native endogenous substances and unaccompanied by 
associated native glycosylation. Polypeptides or proteins expressed in most bacterial cultures, 
e.g., E. colU will be free of glycosylation modifications; polypeptides or proteins expressed in 
yeast will have a glycosylation pattern in general different from those expressed in mammalian 
cells. 

25 The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 

or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 
comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 
having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 

30 appropriate transcription initiation and termination sequences. Structural units intended for use in 
yeast or eukaryotic expression systems preferably include a leader sequence enabling extracellular 
secretion of translated protein by a host cell. Alternatively, where recombinant protein is 

12 
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expressed without a leader or transport sequence, it may include an amino terminal methionine 
residue. This residue may or may not be subsequently cleaved from the expressed recombinant 
protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
5 a recombinant transcriptional unit into chromosomal DNA or carry the recombinant 

transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked to 
the DNA segment or synthetic gene to be expressed. This teim also means host cells which have 
stably integrated a recombinant genetic element or elements having a regulatory role in gene 

10 expression, for example, promoters or enhancers. Recombinant expression systems as defined 
herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

The term "secreted" includes a protein that is transported across or through a membrane, 

15 including transport as a result of signal sequences in its amino acid sequence when it is expressed 
in a suitable host cell. "Secreted" proteins include without limitation proteins secreted wholly 
(e.g., soluble proteins) or partially (e.g. 9 receptors) from the cell in which they are expressed. 
"Secreted" proteins also include without limitation proteins that are transported across the 
membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to include proteins 

20 containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, P. A. and Young, 
P.R. (1992) Cytokine 4(2): 134 -143) and factors released from damaged cells (e.g. Interleukin-1 
Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 

25 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the art 
as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization to 
filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, 

30 and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., washing 
in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are described herein 
in the examples. 
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In instances of hybridization of deoxyoligonucleo tides, additional exemplary stringent 

hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
60°C (for 23-base oligonucleotides). 
5 As used herein, "substantially equivalent" or "substantially similar" can refer both to 

nucleotide and amino acid sequences, for example a mutant sequence, that varies from a reference 
sequence by one or more substitutions, deletions, or additions, the net effect of which does not 
result in an adverse functional dissimilarity between the reference and subject sequences. 
Typically, such a substantially equivalent sequence varies from one of those listed herein by no 

10 more than about 35% (i.e., the number of individual residue substitutions, additions, and/or 
deletions in a substantially equivalent sequence, as compared to the corresponding reference 
sequence, divided by the total number of residues in the substantially equivalent sequence is about 
0.35 or less). Such a sequence is said to have 65% sequence identity to the listed sequence. In 
one embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 

15 listed sequence by no more than 30% (70% sequence identity); in a variation of this embodiment, 
by no more than 25% (75% sequence identity); and in a further variation of this embodiment, by 
no more than 20% (80% sequence identity) and in a further variation of this embodiment, by no 
more than 10% (90% sequence identity) and in a further variation of this embodiment, by no more 
that 5% (95% sequence identity). Substantially equivalent, e.g., mutant, amino acid sequences 

20 according to the invention preferably have at least 80% sequence identity with a listed amino acid 
sequence, more preferably at least 85% sequence identity, more preferably at least 90% sequence 
identity, more preferably at least 95% sequence identity, more preferably at least 98% sequence 
identity, and most preferably at least 99% sequence identity. Substantially equivalent nucleotide 
sequence of the invention can have lower percent sequence identities, taking into account, for 

25 example, the redundancy or degeneracy of the genetic code. Preferably, the nucleotide sequence 
has at least about 65% identity, more preferably at least about 75% identity, more preferably at 
least about 80% sequence identity, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least about 95% sequence identity, 
more preferably at least 98% sequence identity, and most preferably at least 99% sequence 

30 identity. For the purposes of the present invention, sequences having substantially equivalent 
biological activity and substantially equivalent expression characteristics are considered 
substantially equivalent. For the purposes of determining equivalence, truncation of the mature 
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sequence (e.g., via a mutation which creates a spurious stop codon) should be disregarded. 
Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. (1990) Methods 
Enzymol. 183:626-645). Identity between sequences can also be determined by other methods 
known in the art, e.g. by varying hybridization conditions. 
5 The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 

types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 

1 0 or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
using known UMFs as a target sequence or target motif with the computer-based systems 

15 described below. The presence and activity of a UMF can be confirmed by attaching the 

suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated with 
an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determined. As described above, a UMF will increase the frequency of uptake of a linked marker 
sequence. 

20 Each of the above terms is meant to encompass all that is described for each, unless the 

context dictates otherwise. 



4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

25 The isolated polynucleotides of the invention include a polynucleotide comprising the 

nucleotide sequences of SEQ ID NO: 1 - 1 1 ; a polynucleotide encoding any one of the peptide 
sequences of SEQ ID NO: 1 - 1 1; and a polynucleotide comprising the nucleotide sequence 
encoding the mature protein coding sequence of the polynucleotides of any one of SEQ ID NO: 1 
-11. The polynucleotides of the present invention also include, but are not limited to, a 

30 polynucleotide that hybridizes under stringent conditions to (a) the complement of any of the 
nucleotides sequences of SEQ ID NO: 1 - 1 1 ; (b) nucleotide sequences encoding any one of the 
amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an allelic 

15 
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variant of any polynucleotide recited above; (d) a polynucleotide which encodes a species 
homo log of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 1-11. Domains of 
interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like 
5 polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, or 
combinations thereof; domains in immunoglobulin-like proteins include the variable 
immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding domains. 
The polynucleotides of the invention include naturally occurring or wholly or partially 
10 synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides may 
include the entire coding region of the cDNA or may represent a portion of the coding region of 
the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 

1 5 sequence information disclosed herein. Such methods include the preparation of probes or primers 
from the disclosed sequence information for identification and/or amplification of genes in 
appropriate genomic libraries or other sources of genomic materials. Further 5' and 3 f sequence can 
be obtained using methods known in the art. For example, full length cDNA or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1-11 can be obtained by screening 

20 appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NO: 1 - 1 1 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO: 1-11 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

The nucleic acid sequences of the invention can be assembled from ESTs and sequences 

25 (including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 
gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
30 sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
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88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 

least about 95%, 96%, 97%, 98%, 99% sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 

sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
5 of SEQ ID NO: 1 - 1 1, or complements thereof, which fragment is greater than about 5 

nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 

preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 

are selective for (i.e. specifically hybridize to) any one of the polynucleotides of the invention are 

contemplated. Probes capable of specifically hybridizing to a polynucleotide can differentiate 
10 polynucleotide sequences of the invention from other polynucleotide sequences in the same 

family of genes or can differentiate human genes from genes of other species, and are preferably 

based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 

specific sequences, but also include allelic and species variations thereof. Allelic and species 
1 5 variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1 - 1 1 , a 

representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 

identical, to SEQ ID NOs: 1-11 with a sequence from another isolate of the same species. 

Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 

coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, 
20 in the coding region of an ORF, substitution of one codon for another codon that encodes the same 

amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 

including SEQ ID NOs: 1 - 1 1, can be obtained by searching a database using an algorithm or a 

program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
25 search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and Altschul S.F. 

et al. J. Mol. Biol. 21:403-410 (1990)). Alternatively a FASTA version 3 search against Genpept, 

using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 

provided by the present invention. Species homologs may be isolated and identified by making 
30 suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

acid source from the desired species. 
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The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotides which also 
encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

5 The nucleic acid sequences of the invention are further directed to sequences which 

encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a native 
or variant polynucleotide. There are two variables in the construction of amino acid sequence 
variants: the location of the mutation and the nature of the mutation. Nucleic acids encoding the 

10 amino acid sequence variants are preferably constructed by mutating the polynucleotide to encode 
an amino acid sequence that does not occur in nature. These nucleic acid alterations can be made 
at sites that differ in the nucleic acids from different species (variable positions) or in highly 
conserved regions (constant regions). Sites at such locations will typically be modified in series, 
e.g., by substituting first with conservative choices (e.g., hydrophobic amino acid to a different 

1 5 hydrophobic amino acid) and then with more distant choices (e.g., hydrophobic amino acid to a 
charged amino acid), and then deletions or insertions may be made at the target site. Amino acid 
sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 residues, 
and are typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal 
fusions ranging in length from one to one hundred or more residues, as well as intrasequence 

20 insertions of single or multiple amino acid residues. Intrasequence insertions may range generally 
from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of terminal 
insertions include the heterologous signal sequences necessary for secretion or for intracellular 
targeting in different host cells and sequences such as FLAG or poly-histidine sequences useful 
for purifying the expressed protein. 

25 In a preferred method, polynucleotides encoding the novel amino acid sequences are 

changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent nucleotides 
on both sides of the changed amino acid to form a stable duplex on either side of the site of being 
changed. In general, the techniques of site-directed mutagenesis are well known to those of skill 

30 in the art and this technique is exemplified by publications such as, Edelman et al., DNA 2:183 
(1983). A versatile and efficient method for producing site-specific changes in a polynucleotide 
sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may 
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also be used to create amino acid sequence variants of the novel nucleic acids. When small 
amounts of template DNA are used as starting material, primer(s) that differs slightly in sequence 
from the corresponding region in the template DNA can generate the desired amino acid variant. 
PCR amplification results in a population of product DNA fragments that differ from the 
5 polynucleotide template encoding the polypeptide at the position specified by the primer. The 
product DNA fragments replace the corresponding region in the plasmid and this gives a 
polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well 
10 known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 
Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
of these novel nucleic acids. Such DNA sequences include those which are capable of 
15 hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used to 
generate polynucleotides encoding chimeric or fusion proteins comprising one or more domains 
of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
20 polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known to 
those of skill in the art and can include, for example, methods for determining hybridization 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature protein 
25 coding sequences corresponding to any one of SEQ ID NO: 1-1 1, or functional equivalents 

thereof, may be used to generate recombinant DNA molecules that direct the expression of that 
nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also included are the 
cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
30 nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 
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plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the invention 
and a host cell containing the polynucleotide. In general, the vector contains an origin of 
replication functional in at least one organism, convenient restriction endonuclease sites, and a 
5 selectable marker for the host cell Vectors according to the invention include expression vectors, 
replication vectors, probe generation vectors, and sequencing vectors. A host cell according to the 
invention can be a prokaryotic or eukaryotic cell and can be a unicellular organism or part of a 
multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic acid 

10 having any of the nucleotide sequences of SEQ ID NOs: 1 - 1 1 or a fragment thereof or any other 
polynucleotides of the invention. In one embodiment, the recombinant constructs of the present 
invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid having 
any of the nucleotide sequences of SEQ ID NOs: 1 - 1 1 or a fragment thereof is inserted, in a 
forward or reverse orientation. In the case of a vector comprising one of the ORFs of the present 

15 invention, the vector may further comprise regulatory sequences, including for example, a 
promoter, operably linked to the ORF. Large numbers of suitable vectors and promoters are 
known to those of skill in the art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of example. 
Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, 

20 pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 

Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et aL, 

25 Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufinan, Methods in Enzymology 
185, 537-566 (1990). As defined herein "operably linked" means that the isolated polynucleotide 
of the invention and an expression control sequence are situated within a vector or cell in such a 

30 way that the protein is expressed by a host cell which has been transformed (transfected) with the 
ligated polynucleotide/expression control sequence. 
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Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 

pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 

lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, 

5 early and late SV40, LTRs from retrovirus, and mouse metallothionein-L Selection of the 

appropriate vector and promoter is well within the level of ordinary skill in the art. Generally, 

recombinant expression vectors will include origins of replication and selectable markers 

permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli and S. 

cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 

1 0 transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 

1 5 periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. Useful expression 
vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a 
desired protein together with suitable translation initiation and termination signals in operable 

20 reading phase with a functional promoter. The vector will comprise one or more phenotypic 
selectable markers and an origin of replication to ensure maintenance of the vector and to, if 
desirable, provide amplification within the host. Suitable prokaryotic hosts for transformation 
include E. coli, Bacillus subtilis, Salmonella typhimurium and various species within the genera 
Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a 

25 matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 

30 Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 
sequence to be expressed. Following transformation of a suitable host strain and growth of the 
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host strain to an appropriate cell density, the selected promoter is induced or derepressed by 

appropriate means {e.g., temperature shift or chemical induction) and cells are cultured for an 

additional period. Cells are typically harvested by centrifiigation, disrupted by physical or 

chemical means, and the resulting crude extract retained for further purification. 

5 Polynucleotides of the invention can also be used to induce immune responses. For 

example, as described in Fan et al., Nat Biotech. 17:870-872 (1999), incorporated herein by 

reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

against the encoded polypeptide following topical administration of naked plasmid DNA or 

following injection, and preferably intra-muscular injection of the DNA. The nucleic acid 

10 sequences are preferably inserted in a recombinant expression vector and may be in the form of 

naked DNA. 

4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 

1 5 are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1 - 1 1, or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 

20 molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 100, 
250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
NO: 1 - 1 1 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID NO: 1 
- 11 are additionally provided. 

25 In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region' 1 of 

the coding strand of a nucleotide sequence of the invention. The term "coding region" refers to 
the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding 
region" of the coding strand of a nucleotide sequence of the invention. The term "noncoding 

30 region" refers to 5* and 3' sequences that flank the coding region that are not translated into amino 
acids (Le. 9 also referred to as 5' and 3 1 untranslated regions). 
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Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., SEQ ID 

NO: 1-11, antisense nucleic acids of the invention can be designed according to the rules of 

Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 

complementary to the entire coding region of an mRNA, but more preferably is an 

5 oligonucleotide that is antisense to only a portion of the coding or noncoding region of an mRNA. 

For example, the antisense oligonucleotide can be complementary to the region surrounding the 

translation start site of an mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 

15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 

can be constructed using chemical synthesis or enzymatic ligation reactions using procedures 

10 known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can 
be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability of 
the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

1 5 Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 

4- acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 

5- carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

20 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, S-methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 

25 uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 

30 described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
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genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
5 the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 

10 by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are preferred. 

15 In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res 15: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2-o-methylribonucleotide (Inoue et al. (1987) 

20 Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et al (1987) FEBS 
Lett 215: 327-330). 



4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 

25 Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave mRNA transcripts to thereby inhibit 
translation of an mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 

30 designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1 - 
1 1). For example, a derivative of Tetrahymena L-19 IVS RNA can be constructed in which the 
nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved 

24 
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in a mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071 ; and Cech et al. U.S. Pat. No. 
5,1 16,742. Alternatively, mRNA of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) 
Science 261:141 1-1418. 
5 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. etal (1992) Ann. N.Y. Acad. Sci. 660:27-36; andMaher 
(1992) Bioassays 14: 807-15. 

10 In various embodiments, the nucleic acids of the invention can be modified at the base 

moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids 
can be modified to generate peptide nucleic acids (see Hyrup et al (1996) BioorgMed Chem 4: 
5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, 

1 5 e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic 
strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide 
synthesis protocols as described in Hyrup et al. (1996) above; Perry-O'Keefe et al. (1996) PNAS 

20 93: 14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 

25 gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 
primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), 
above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 
30 stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 

formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated that may combine the 
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advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, e.g., 

RNase H and DNA polymerases, to interact with the DNA portion while the PNA portion would 

provide high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of 

appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, 

5 and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras can be performed as 

described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 3357-63. For 

example, a DNA chain can be synthesized on a solid support using standard phosphoramidite 

coupling chemistry, and modified nucleoside analogs, e.g., 

5 , -(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA 
10 and the 5' end of DNA (Mag et al (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3 f 
DNA segment (Finn et al (1996) above). Alternatively, chimeric molecules can be synthesized 
with a 5* DNA segment and a 3' PNA segment. See, Petersen et al. (1975) Bioorg Med Chem Lett 
5: 1119-11124. 

15 In other embodiments, the oligonucleotide may include other appended groups such as 

peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl Acad. Sci. U.S.A. 86:6553-6556; 
Lemaitre et al, 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In addition, 

20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et 
al, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 
5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

25 

4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
30 methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 

26 
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with a regulatory sequence heterologous to the host cell which drives expression of the 

polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 

increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 

5 recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 

naturally occurring promoter with all or part of a heterologous promoter so that the cells express 

the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it is 

operatively linked to the encoding sequences. See, for example, PCT International Publication 

No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 

1 0 Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes 
carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA 
may be inserted along with the heterologous promoter DNA. If linked to the coding sequence, 
amplification of the marker DNA by standard selection methods results in co-amplification of the 

1 5 desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 

20 L. et al, Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the present 

25 invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. The 
most preferred cells are those which do not normally express the particular polypeptide or protein 
or which expresses the polypeptide or protein at low natural level. Mature proteins can be 
expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 

30 promoters. Cell-free translation systems can also be employed to produce such proteins using 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
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al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New York 
(1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
5 fibroblasts, described by Gluzman, Cell 23: 175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary (CHO) 
cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 cells, 
CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived from in 
vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, 

10 HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of replication, a 
suitable promoter and also any necessary ribosome binding sites, polyadenylation site, splice 
donor and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 
sequences. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, 
early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required 

15 nontranscribed genetic elements. Recombinant polypeptides and proteins produced in bacterial 
culture are usually isolated by initial extraction from cell pellets, followed by one or more 
salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein refolding 
steps can be used, as necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification steps. 

20 Microbial cells employed in expression of proteins can be disrupted by any convenient method, 
including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. 

Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 
or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 

25 yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial strains 
include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial strain 
capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it may be 
necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

30 attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
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inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 
may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
5 regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 
negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability of 
the RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence include polyadenylation signals, mRNA stability elements, splice sites, 

10 leader sequences for enhancing or modifying transport or secretion properties of the protein, or 
other sequences which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or 
both upstream of a gene. Alternatively, the targeting event may be a simple deletion of a 

15 regulatory element, such as the deletion of a tissue-specific negative regulatory element. 

Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the 
naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 

20 the use of one or more selectable marker genes that are contiguous with the targeting DNA, 

allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA, but configured such that the negatively 

25 selectable marker flanks the targeting sequence, and such that a correct homologous 

recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 
Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

30 The gene targeting or gene activation techniques which can be used in accordance with 

this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwin et aL; International Application No. 



WO 02/077180 PCT/US02/08964 
PCT/US92/09627 (WO93/09222) by Selden et al; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 



5 4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NO: 1-1 1 or an amino acid 
sequence encoded by any one of the nucleotide sequences SEQ ED NOs: 1 - 1 1 or the 
corresponding full length or mature protein. Polypeptides of the invention also include 

10 polypeptides preferably with biological or immunological activity that are encoded by: (a) a 

polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NOs: 1 - 1 1 or (b) 
polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NO: 1-11 or 
(c) polynucleotides that hybridize to the complement of the polynucleotides of either (a) or (b) 
under stringent hybridization conditions. The invention also provides biologically active or 

15 immunologically active variants of any of the amino acid sequences set forth as SEQ ID NO: 1-11 
or the corresponding full length or mature protein; and "substantial equivalents" thereof (e.g., 
with at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 
85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 92%, 93%, 94%, typically at least about 
95%, 96%, 97%, more typically at least about 98%, or most typically at least about 99% amino 

20 acid identity) that retain biological activity. Polypeptides encoded by allelic variants may have a 
similar, increased, or decreased activity compared to polypeptides comprising SEQ ID NO: 1-1 1. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known methods, for example, as described in H. 

25 U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 
Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 

30 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide sequences. 
The mature form of such protein may be obtained by expression of a full-length polynucleotide in 
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a suitable mammalian cell or other ho^cell. The sequence of the mature form of the protein is 

also determinable from the amino acid sequence of the full-length form. Where proteins of the 

present invention are membrane bound, soluble forms of the proteins are also provided. In such 

forms, part or all of the regions causing the proteins to be membrane bound are deleted so that the 

5 proteins are fully secreted from the cell in which they are expressed. 

Protein compositions of the present invention may further comprise an acceptable carrier, 

such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic acid 

fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 

10 present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 
acid fragments of the present invention are the ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the 

1 5 isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties in 
common therewith, including protein activity. This technique is particularly useful in producing 

20 small peptides and fragments of larger polypeptides. Fragments are useful, for example, in 

generating antibodies against the native polypeptide. Thus, they may be employed as biologically 
active or immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 

25 cells which have been altered to express the desired polypeptide or protein. As used herein, a cell 
is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic sequences into 

30 eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 
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The invention also relates to methods for producing a polypeptide comprising growing a 
culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable expression 
5 vector that includes a polynucleotide of the invention is cultured under conditions that allow 
expression of the encoded polypeptide. The polypeptide can be recovered from the culture, 
conveniently from the culture medium, or from a lysate prepared from the host cells and further 
purified. Preferred embodiments include those in which the protein produced by such process is a 
full length or mature form of the protein. 

10 In an alternative method, the polypeptide or protein is purified from bacterial cells which 

naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated polypeptides 
or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 

15 and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 

Practice, Springer- Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory Manual; 
Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that retain 
biological/immunological activity include fragments comprising greater than about 100 amino 
acids, or greater than about 200 amino acids, and fragments that encode specific protein domains. 

20 The purified polypeptides can be used in in vitro binding assays which are well known in 

the art to identify molecules which bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 

25 molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 
may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 

30 specificity of the binding molecule for SEQ ID NO: 1-11. 
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The protein of the invention may also be expressed as a product of transgenic animals, 
e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
5 similar to those of purified proteins but into which modification are naturally provided or 
deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 
made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a selected 
amino acid residue in the coding sequence. For example, one or more of the cysteine residues 

10 may be deleted or replaced with another amino acid to alter the conformation of the molecule. 
Techniques for such alteration, substitution, replacement, insertion or deletion are well known to 
those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such alteration, 
substitution, replacement, insertion or deletion retains the desired activity of the protein. Regions 
of the protein that are important for the protein function can be determined by various methods 

1 5 known in the art including the alanine-scanning method which involved systematic substitution of 
single or strings of amino acids with alanine, followed by testing the resulting alanine-containing 
variant for biological activity. This type of analysis determines the importance of the substituted 
amino acid(s) in biological activity. Regions of the protein that are important for protein function 
may be determined by the eMATRIX program. 

20 Other fragments and derivatives of the sequences of proteins which would be expected to 

retain protein activity in whole or in part and are useful for screening or other immunological 
methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 

25 invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif., U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 

30 reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 
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The protein of the invention may be prepared by culturing transformed host cells under 
culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
5 of the protein may also include an affinity column containing agents which will bind to the 
protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 

1 0 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression and purification of such fusion proteins are commercially available 
from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 

1 5 respectively. The protein can also be tagged with an epitope and subsequently purified by using a 
specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 
steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 

20 aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 
homogeneous isolated recombinant protein. The protein thus purified is substantially free of other 
mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 

25 The polypeptides of the invention include analogs (variants). This embraces fragments, as 

well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 
modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
another moiety or moieties, e.g., targeting moiety or another therapeutic agent. Such analogs may 

30 exhibit improved properties such as activity and/or stability. Examples of moieties which may be 
fused to the polypeptide or an analog include, for example, targeting moieties which provide for 
the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, antibodies to 
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immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well as receptor 
and ligands expressed on pancreatic or immune cells. Other moieties which may be fused to the 
polypeptide include therapeutic agents which are used for treatment, for example, 
immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and steroids. 
5 Also, polypeptides may be fused to immune modulators, and other cytokines such as alpha or beta 
interferon. 



4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 
AND SIMILARITY 

1 0 Preferred identity and/or similarity are designed to give the largest match between the 

sequences tested. Methods to determine identity and similarity are codified in computer programs 
including, but are not limited to, the GCG program package, including GAP (Devereux, J., et al., 
Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, University of Wisconsin, 
Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. et al., J. Molec. Biol. 

15 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. vol. 25, pp. 3389-3402, 
herein incorporated by reference), eMatrix software (Wu et al., J. Comp. Biol., Vol. 6, pp. 219- 
235 (1999), herein incorporated by reference), eMotif software (Nevill-Manning et al, ISMB-97, 
Vol. 4, pp. 202-209, herein incorporated by reference), pFam software (Sonnhammer et al, 
Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by reference) and the 

20 Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 105-31 (1982), 

incorporated herein by reference). The BLAST programs are publicly available from the National 
Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et 
al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

25 The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 

protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to another 
polypeptide. Within a fusion protein the polypeptide according to the invention can correspond to 
all or a portion of a protein according to the invention. In one embodiment, a fusion protein 
comprises at least one biologically active portion of a protein according to the invention. In 
30 another embodiment, a fusion protein comprises at least two biologically active portions of a 
protein according to the invention. Within the fusion protein, the term "operatively linked" is 
intended to indicate that the polypeptide according to the invention and the other polypeptide are 
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fused in-frame to each other. The polypeptide can be fused to the N-terminus or C-terminus, or to 

the middle. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 
5 In another embodiment, the fusion protein is a GST-fusion protein in which the 

polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
the polypeptide sequences according to the invention comprise one or more domains fused to 

1 0 sequences derived from a member of the immunoglobulin protein family. The immunoglobulin 
fusion proteins of the invention can be incorporated into pharmaceutical compositions and 
administered to a subject to inhibit an interaction between a ligand and a protein of the invention 
on the surface of a cell, to thereby suppress signal transduction in vivo. The immunoglobulin 
fusion proteins can be used to affect the bioavailability of a cognate ligand. Inhibition of the 

15 ligand/protein interaction may be useful therapeutically for both the treatment of proliferative and 
differentiative disorders, e.g., cancer as well as modulating (e.g., promoting or inhibiting) cell 
survival. Moreover, the immunoglobulin fusion proteins of the invention can be used as 
immunogens to produce antibodies in a subject, to purify ligands, and in screening assays to 
identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

20 A chimeric or fusion protein of the invention can be produced by standard recombinant 

DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 
are li gated together in-frame in accordance with conventional techniques, e.g., by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 

25 avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. Alternatively, 
PCR amplification of gene fragments can be carried out using anchor primers that give rise to 
complementary overhangs between two consecutive gene fragments that can subsequently be 
annealed and reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. 

30 (eds.) Current Protocols in Molecular Biology, John Wiley & Sons, 1992). Moreover, 
many expression vectors are commercially available that already encode a fusion moiety (e.g., a 
GST polypeptide). A nucleic acid encoding a polypeptide of the invention can be cloned into 
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such an expression vector such that the fusion moiety is linked in-frame to the protein of the 

invention. 



4.8 GENE THERAPY 
5 Mutations in the polynucleotides of the invention gene may result in loss of normal 

function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to appropriate 
cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly viral vectors 

10 (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA 
transfer methods (e.g., liposomes or chemical treatments). See, for example, Anderson, Nature, 
supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of gene therapy 
technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific American: 68-84 
(1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of the nucleotides of 

15 the present invention or a gene encoding the polypeptides of the present invention can also be 
accomplished with extrachromosomal substrates (transient expression) or artificial chromosomes 
(stable expression). Cells may also be cultured ex vivo in the presence of proteins of the present 
invention in order to proliferate or to produce a desired effect on or activity in such cells. Treated 
cells can then be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated that 

20 in other human disease states, preventing the expression of or inhibiting the activity of 

polypeptides of the invention will be useful in treating the disease states. It is contemplated that 
antisense therapy or gene therapy could be applied to negatively regulate the expression of 
polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 

25 molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by methods known in the art. Further, the polypeptides of the present invention can be 
inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to express the 

30 polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 

the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g., by 
5 homologous recombination) to provide increased polypeptide expression by replacing, in whole or in 
part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
operatively linked to the desired protein encoding sequences. See, for example, PCT International 
Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and PCT 

1 0 International Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron 
DNA may be inserted along with the heterologous promoter DNA. If linked to the desired protein 
coding sequence, amplification of the marker DNA by standard selection methods results in 

1 5 co-amplification of the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 

25 protein produced may be replaced, removed, added, or otherwise modified by targeting. These 

sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences which 
alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 

3 0 under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
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targeting event may replace an existing element; for example, a tissue-specific enhancer can be 

replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring 

elements. Here, the naturally occurring sequences are deleted and new sequences are added. In all 

cases, the identification of the targeting event may be facilitated by the use of one or more selectable 

5 marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which 

the exogenous DNA has integrated into the cell genome. The identification of the targeting event 

may also be facilitated by the use of one or more marker genes exhibiting the property of negative 

selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured 

such that the negatively selectable marker flanks the targeting sequence, and such that a correct 

10 homologous recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the Herpes 
Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 

1 5 aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Chappel; U.S. 
Patent No. 5,578,461 to Sherwin et al.; International Application No. PCT/US92/09627 
(WO93/09222) by Selden et al.; and International Application No. PCT/US90/06436 (WO91/06667) 
by Skoultchi et al., each of which is incorporated by reference herein in its entirety. 

20 4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the invention 
in vivo, one or more genes provided by the invention are either over expressed or inactivated in 
the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 
(1989)]. Animals in which the gene is over expressed, under the regulatory control of exogenous 

25 or endogenous promoter elements, are known as transgenic animals. Animals in which an 
endogenous gene has been inactivated by homologous recombination are referred to as 
"knockout" animals. Knockout animals, preferably non-human mammals, can be prepared as 
described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic animals are 
useful to determine the roles polypeptides of the invention play in biological processes, and 

30 preferably in disease states. Transgenic animals are useful as model systems to identify 
compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
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mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of expression of 
5 the polypeptides of the invention. Inactivation can be carried out using homologous 

recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

10 The polynucleotides of the present invention also make possible the development, through, 

e.g., homologous recombination or knock out strategies, of animals that fail to express 
polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 

1 5 In preferred methods to determine biological functions of the polypeptides of the invention 

in vivo, one or more genes provided by the invention are either over expressed or inactivated in 
the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 
(1989)]. Animals in which the gene is over expressed, under the regulatory control of exogenous 
or endogenous promoter elements, are known as transgenic animals. Animals in which an 

20 endogenous gene has been inactivated by homologous recombination are referred to as 

"knockout" animals. Knockout animals, preferably non-human mammals, can be prepared as 
described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic animals are 
useful to determine the roles polypeptides of the invention play in biological processes, and 
preferably in disease states. Transgenic animals are useful as model systems to identify 

25 compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 

30 polypeptides of the invention. Inactivation can be carried out using homologous recombination 
methods described above. Activation can be achieved by supplementing or even replacing the 
homologous promoter to provide for increased protein expression. The homologous promoter can 
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be supplemented by insertion of one or more heterologous enhancer elements known to confer 

promoter activation in a particular tissue. 



4,10 USES AND BIOLOGICAL ACTIVITY 

5 The polynucleotides and proteins of the present invention are expected to exhibit one or 

more of the uses or biological activities (including those associated with assays cited herein) 
identified herein. Uses or activities described for proteins of the present invention may be 
provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 

1 0 mechanism underlying the particular condition or pathology will dictate whether the polypeptides 
of the invention, the polynucleotides of the invention or modulators (activators or inhibitors) 
thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic compositions 
of the invention" include compositions comprising isolated polynucleotides (including 
recombinant DNA molecules, cloned genes and degenerate variants thereof) or polypeptides of 

15 the invention (including full length protein, mature protein and truncations or domains thereof), or 
compounds and other substances that modulate the overall activity of the target gene products, 
either at the level of target gene/protein expression or target protein activity. Such modulators 
include polypeptides, analogs, (variants), including fragments and fusion proteins, antibodies and 
other binding proteins; chemical compounds that directly or indirectly activate or inhibit the 

20 polypeptides of the invention (identified, e.g., via drug screening assays as described herein); 

antisense polynucleotides and polynucleotides suitable for triple helix formation; and in particular 
antibodies or other binding partners that specifically recognize one or more epitopes of the 
polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 

25 or in one of the other physiological pathways described herein. 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant protein 
30 for analysis, characterization or therapeutic use; as markers for tissues in which the corresponding 
protein is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in disease states); as molecular weight markers on gels; as 
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chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 
positions; to compare with endogenous DNA sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
5 sequences in the process of discovering other novel polynucleotides; for selecting and making 
oligomers for attachment to a "gene chip" or other support, including for examination of 
expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 
an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 
polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 

10 example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al, Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 
the binding interaction. 

The polypeptides provided by the present invention can similarly be used in assays to 

1 5 determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 

20 development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 

25 Methods for performing the uses listed above are well known to those skilled in the art. 

References disclosing such methods include without limitation "Molecular Cloning: A Laboratory 
Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. 
Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning Techniques", 
Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

30 

4.10.2 NUTRITIONAL USES 
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Polynucleotides and polypeptides of the present invention can also be used as nutritional 

sources or supplements. Such uses include without limitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 

such cases the polypeptide or polynucleotide of the invention can be added to the feed of a particular 

5 organism or can be administered as a separate solid or liquid preparation, such as in the form of 

powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the polypeptide or 

polynucleotide of the invention can be added to the medium in or on which the microorganism is 

cultured. 



1 0 4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 

ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain cell populations. A polynucleotide 

15 of the invention can encode a polypeptide exhibiting such attributes. Many protein factors 
discovered to date, including all known cytokines, have exhibited activity in one or more 
factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 

20 assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 
MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 
HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 

25 Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In 
Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, et 
al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

30 Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 

thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. 
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Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse and 

human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan eds. 

Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 
5 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 

and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 

Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 

deVries et al., J. Exp. Med. 173:1205-121 1, 1991; Moreau et al., Nature 336:690-692, 1988; 

Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse and 
10 human interleukin 6-Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 

pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. U.S.A. 

83:1857-1861, 1986; Measurement of human Interleukin 1 1 --Bennett, F., Giannotti, J., Clark, S. 

C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 

John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 9-Ciarletta, 
15 A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan 

eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which will identify, among others, proteins 

that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 

cytokine production) include, without limitation, those described in: Current Protocols in 
20 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 

Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 

Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, Immunologic 

studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; 

Weinberger et al, Eur. J. Immun. 11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 
25 1986; Takai et al, J. Immunol. 140:508-512, 1988. 



4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

A polypeptide of the present invention may exhibit stem cell growth factor activity and be 
involved in the proliferation, differentiation and survival of pluripotent and totipotent stem cells 
30 including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or germ line 
stem cells. Administration of the polypeptide of the invention to stem cells in vivo or ex vivo is 
expected to maintain and expand cell populations in a totipotential or pluripotential state which 
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would be useful for re-engineering damaged or diseased tissues, transplantation, manufacture of 
bio-pharmaceuticals and the development of bio-sensors. The ability to produce large quantities 
of human cells has important working applications for the production of human proteins which 
currently must be obtained from non-human sources or donors, implantation of cells to treat 
5 diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; tissues for 
grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including cardiac muscle), 
blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs for transplantation 
such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 

1 0 be administered in combination with the polypeptide of the invention to achieve the desired effect, 
including any of the growth factors listed herein, other stem cell maintenance factors, and 
specifically including stem cell factor (SCF), leukemia inhibitory factor (LLF), Flt-3 ligand (Flt- 
3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 
inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 

15 factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic fibroblast 
growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 

20 optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 
with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 

25 may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to induce 
autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 

30 differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
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identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present invention 
5 may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 
genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 

10 disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 
the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 

1 5 types. A broadly applicable method of obtaining pure populations of a specific differentiated cell 
type from undifferentiated stem cell populations involves the use of a cell-type specific promoter 
driving a selectable marker. The selectable marker allows only cells of the desired type to 
survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus et 
al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) or 

20 skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 

Academic Press (1997)). Alternatively, directed differentiation of stem cells can be accomplished 
by culturing the stem cells in the presence of a differentiation factor such as retinoic acid and an 
antagonist of the polypeptide of the invention which would inhibit the effects of endogenous stem 
cell factor activity and allow differentiation to proceed. 

25 In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 

exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in the 
presence of the polypeptide of the invention alone or in combination with other growth factors or 

30 cytokines. The ability of the polypeptide of the invention to induce stem cells proliferation is 

determined by colony formation on semi-solid support e.g. as described by Bernstein et al., Blood, 
77:2316-2321 (1991). 
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4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
5 biological activity in support of colony forming cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating utility, 
for example, in treating various anemias or for use in conjunction with irradiation/chemotherapy 
to stimulate the production of erythroid precursors and/or erythroid cells; in supporting the growth 

10 and proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., 

traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent or treat 
consequent myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet disorders 
such as thrombocytopenia, and generally for use in place of or complimentary to platelet 

15 transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells which 
are capable of maturing to any and all of the above-mentioned hematopoietic cells and therefore 
find therapeutic utility in various stem cell disorders (such as those usually treated with 
transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 

20 irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 

transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 
as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 

Suitable assays for proliferation and differentiation of various hematopoietic lines are cited 

25 above. 

Assays for embryonic stem cell differentiation (which will identify, among others, proteins 
that influence embryonic differentiation hematopoiesis) include, without limitation, those 
described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular and 
Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 1993. 
30 Assays for stem cell survival and differentiation (which will identify, among others, 

proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
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Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
5 al, Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 

Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. L 
Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
10 initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. 
Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 



4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
15 ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue repair 
and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone growth in 
circumstances where bone is not normally formed, has application in the healing of bone fractures 
and cartilage damage or defects in humans and other animals. Compositions of a polypeptide, 
20 antibody, binding partner, or other modulator of the invention may have prophylactic use in 

closed as well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of congenital, 
trauma induced, or oncologic resection induced craniofacial defects, and also is useful in cosmetic 
plastic surgery. 

25 A polypeptide of this invention may also be involved in attracting bone-forming cells, 

stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 
inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 

30 mediated by inflammatory processes may also be possible using the composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or other 
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tissue formation in circumstances where such tissue is not normally formed, has application in the 

healing of tendon or ligament tears, deformities and other tendon or ligament defects in humans 

and other animals. Such a preparation employing a tendon/ligament-like tissue inducing protein 

may have prophylactic use in preventing damage to tendon or ligament tissue, as well as use in 

5 the improved fixation of tendon or ligament to bone or other tissues, and in repairing defects to 

tendon or ligament tissue. De novo tendon/ligament-like tissue formation induced by a 

composition of the present invention contributes to the repair of congenital, trauma induced, or 

other tendon or ligament defects of other origin, and is also useful in cosmetic plastic surgery for 

attachment or repair of tendons or ligaments. The compositions of the present invention may 

10 provide environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or 
ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming cells, 
or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue 
repair. The compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also include an 

15 appropriate matrix and/or sequestering agent as a carrier as is well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 

20 composition may be used in the treatment of diseases of the peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 

25 cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a composition 
of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
30 insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue to 
regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
5 A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 

A composition of the present invention may also be useful for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
10 growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
15 WO91/07491 (skin, endothelium). 

Assays for wound healing activity include, without limitation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I. and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84(1978). 

20 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 

25 protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and 
other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 

30 specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 

treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 
herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such as 

50 
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candidiasis. Of course, in this regard, proteins of the present invention may also be useful where 

a boost to the immune system generally may be desirable, i.e., in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present invention 

include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 

5 rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 

autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 

disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 

including antibodies) of the present invention may also to be useful in the treatment of allergic 

reactions and conditions {e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 

1 0 venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 

angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, Stevens- 
Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 

15 suppression is desired (including, for example, organ transplantation), may also be treatable using 
a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 
models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffinann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 

20 test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., J. 
Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune responses, 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an immune 
response already in progress or may involve preventing the induction of an immune response. The 

25 functions of activated T cells may be inhibited by suppressing T cell responses or by inducing 
specific tolerance in T cells, or both. Immunosuppression of T cell responses is generally an 
active, non-antigen-specific, process which requires continuous exposure of the T cells to the 
suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy in T cells, is 
distinguishable from immunosuppression in that it is generally antigen-specific and persists after 

30 exposure to the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the 
lack of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. 
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Down regulating or preventing one or more antigen functions (including without limitation 

B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high level 

lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and organ 

transplantation and in graft- versus-host disease (GVHD). For example, blockage of T cell 

5 function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 

transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 

followed by an immune reaction that destroys the transplant. The administration of a therapeutic 

composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 

and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient 

10 to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration of 
these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 
may also be necessary to block the function of a combination of B lymphocyte antigens. 
The efficacy of particular therapeutic compositions in preventing organ transplant 

15 rejection or GVHD can be assessed using animal models that are predictive of efficacy in humans. 
Examples of appropriate systems which can be used include allogeneic cardiac grafts in rats and 
xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine the 
immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et al., 
Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 (1992). 

20 In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven Press, New 
York, 1989, pp. 846-847) can be used to determine the effect of therapeutic compositions of the 
invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

25 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 
cell-derived cytokines which may be involved in the disease process. Additionally, blocking 

30 reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 

long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 
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human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 
collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental 
myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
5 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means of 
up regulating immune responses, may also be useful in therapy. Upregulation of immune 
responses may be in the form of enhancing an existing immune response or eliciting an initial 
immune response. For example, enhancing an immune response may be useful in cases of viral 

10 infection, including systemic viral diseases such as influenza, the common cold, and encephalitis. 
Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of a 
soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 

15 patient. Another method of enhancing anti- viral immune responses would be to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present invention 
as described herein such that the cells express all or a portion of the protein on their surface, and 
reintroduce the transfected cells into the patient. The infected cells would now be capable of 
delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

20 A polypeptide of the present invention may provide the necessary stimulation signal to T 

cells to induce a T cell mediated immune response against the transfected tumor cells. In addition, 
tumor cells which lack MHC class I or MHC class II molecules, or which fail to reexpress 
sufficient mounts of MHC class I or MHC class II molecules, can be transfected with nucleic acid 
encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an MHC class I 

25 alpha chain protein and P2 microglobulin protein or an MHC class II alpha chain protein and an 
MHC class II beta chain protein to thereby express MHC class I or MHC class II proteins on the 
cell surface. Expression of the appropriate class I or class II MHC in conjunction with a peptide 
having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
immune response against the transfected tumor cell. Optionally, a gene encoding an antisense 

30 construct which blocks expression of an MHC class II associated protein, such as the invariant 
chain, can also be cotransfected with a DNA encoding a peptide having the activity of a B 
lymphocyte antigen to promote presentation of tumor associated antigens and induce tumor 
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specific immunity. Thus, the induction of a T cell mediated immune response in a human subject 
may be sufficient to overcome tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by the 
following methods: 

5 Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, those 

described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic 
studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann 

10 et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. Immunol. 135:1564-1572, 1985; Takai et 
al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Bowman et 
al., J. Virology 61:1992-1998; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Brown 
et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching (which will 

15 identify, among others, proteins that modulate T-cell dependent antibody responses and that affect 
Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. Immunol. 
144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, Mond, J. J. 
and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 
3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

20 Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 

that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 

25 Humans); Takai et al, J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed by 
dendritic cells that activate naive T-cells) include, without limitation, those described in: Guery et 
al., J. Immunol. 134:536-544, 1995; Inabaet al., Journal of Experimental Medicine 173:549-559, 

30 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et al., Journal of 
Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; 
Huang et al., Science 264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 
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169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 1994; and 
Inaba et al., Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins that 
prevent apoptosis after superantigen induction and proteins that regulate lymphocyte homeostasis) 
5 include, without limitation, those described in: Darzynkiewicz et al., Cytometry 13:795-808, 
1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 
53:1945-1951, 1993; Itoh et al, Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 
Journal of Oncology 1:639-648, 1992. 
10 Assays for proteins that influence early steps of T-cell commitment and development 

include, without limitation, those described in: Antica et al., Blood 84:1 1 1-1 17, 1994; Fine et al, 
Cellular Immunology 155:1 1 1-122, 1994; Galy et al, Blood 85:2770-2778, 1995; Toki et al, 
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 



15 4.10.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 

20 release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, alone 
or in heterodimers with a member of the inhibin family, may be useful as a contraceptive based on 
the ability of inhibins to decrease fertility in female mammals and decrease spermatogenesis in 
male mammals. Administration of sufficient amounts of other inhibins can induce infertility in 
these mammals. Alternatively, the polypeptide of the invention, as a homodimer or as a 

25 heterodimer with other protein subunits of the inhibin group, may be useful as a fertility inducing 
therapeutic, based upon the ability of activin molecules in stimulating FSH release from cells of 
the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A polypeptide of the invention 
may also be useful for advancement of the onset of fertility in sexually immature mammals, so as 
to increase the lifetime reproductive performance of domestic animals such as, but not limited to, 

30 cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 
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Assays for activin/inhibin activity include, without limitation, those described in: Vale et 
ah, Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et al., Nature 
321:776-779, 1986; Mason et al, Nature 318:659-663, 1985; Forage et al., Proc. Natl Acad. Sci. 
USA 83:3091-3095,1986. 

5 

4.10,9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, T-cells, 
mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the invention can 

10 encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic receptor 

activation can be used to mobilize or attract a desired cell population to a desired site of action. 
Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 
modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 

15 lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 

20 Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of cells 

25 across a membrane as well as the ability of a protein to induce the adhesion of one cell population 
to another cell population. Suitable assays for movement and adhesion include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 6.12.1-6.12.28; 

30 Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Muller et 
al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; Johnston et 
al. J. of Immunol. 153:1762-1768, 1994. 



WO 02/077180 



PCT/US02/08964 



4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
5 attributes. Compositions may be useful in treatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 
in treating wounds resulting from trauma, surgery or other causes. A composition of the invention 
may also be useful for dissolving or inhibiting formation of thromboses and for treatment and 
prevention of conditions resulting therefrom (such as, for example, infarction of cardiac and 
1 0 central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those described 
in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 
45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
15 35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 

20 invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 

25 predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 

30 phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 
cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
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larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 
cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including 
5 bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, kidney 
cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 

1 0 tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 
hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

15 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a portion 

20 of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or modulator of 
the invention with one or more anti-cancer drugs in addition to a pharmaceutically acceptable 
carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. Anti-cancer 
drugs that are well known in the art and can be used as a treatment in combination with the 
polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 

25 Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 

DDP), Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, 
Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (VI 6-2 13), 
Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 

30 Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 

Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
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Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 

Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 

treatment of cancer. There are hereditary conditions and/or environmental situations (e.g.. 

5 exposure to carcinogens) known in the art that predispose an individual to developing cancers. 

Under these circumstances, it may be beneficial to treat these individuals with therapeutically 

effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 

invention as a potential cancer treatment. These in vitro models include proliferation assays of 

10 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), tumor 
systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 (1974), 
mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkington et al, Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 

15 of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial cell 
migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., Clin. 
Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, e.g. from 
American Type Tissue Culture Collection catalogs. 

20 4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, receptor 
ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the invention 
can encode a polypeptide exhibiting such characteristics. Examples of such receptors and ligands 
include, without limitation, cytokine receptors and their ligands, receptor kinases and their 

25 ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions and 
their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
are also useful for screening of potential peptide or small molecule inhibitors of the relevant 

30 receptor/ligand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 
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The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 
5 Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 
Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al., Proc. 
Natl. Acad. Sci. USA 84:6864-6868, 1987; Biereret al., J. Exp. Med. 168:1145-1156, 1988; 
Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 
10 By way of example, the polypeptides of the invention may be used as a receptor for a 

ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 
through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 
overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 
1 5 partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 
Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic Press, 
Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and carbon- 14 . 
20 Examples of colorimetric molecules include, but are not limited to, fluorescent molecules such as 
fluorescamine, or rhodamine or other colorimetric molecules. Examples of toxins include, but are 
not limited, to ricin. 

4.10.13 DRUG SCREENING 

25 This invention is particularly useful for screening chemical compounds by using the novel 

polypeptides or binding fragments thereof in any of a variety of drug screening techniques. The 
polypeptides or fragments employed in such a test may either be free in solution, affixed to a solid 
support, borne on a cell surface or located intracellularly. One method of drug screening utilizes 
eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic acids 

30 expressing the polypeptide or a fragment thereof. Drugs are screened against such transformed 
cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for 
standard binding assays. One may measure, for example, the formation of complexes between 
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polypeptides of the invention or fragments and the agent being tested or examine the diminution 
in complex formation between the novel polypeptides and an appropriate cell line, which are well 
known in the art. 

Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 
5 increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 

organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised 
of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased from a number of commercial 
sources, and may include structural analogs of known compounds or compounds that are 
10 identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and fungi), 
animals, plants or other vegetation, or marine organisms, and libraries of mixtures for screening 
may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
1 5 polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a review, 
see Science 252:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, PCR, 
cloning or proprietary synthetic methods. Of particular interest are peptide and oligonucleotide 
20 combinatorial libraries. Still other libraries of interest include peptide, protein, peptidomimetic, 
multiparallel synthetic collection, recombinatorial, and polypeptide libraries. For a review of 
combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. Biotechnol 
8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see Al-Obeidi et al., 
MoL Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 1(1):1 14-19 (1997); 
25 Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 
polypeptide of the invention. The molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
30 art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animal/cells. 
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The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
5 complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide e.g. a 
ligand or a receptor. The art provides numerous assays particularly useful for identifying 

10 previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 
to identify polynucleotides encoding binding partners. As another example, affinity 
chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number of 

15 different libraries used for the identification of compounds, and in particular small molecules, that 
modulate (i.e., increase or decrease) biological activity of a polypeptide of the invention. Ligands 
for receptor polypeptides of the invention can also be identified by adding exogenous ligands, or 
cocktails of ligands to two cells populations that are genetically identical except for the expression 
of the receptor of the invention: one cell population expresses the receptor of the invention 

20 whereas the other does not. The response of the two cell populations to the addition of ligands(s) 
are then compared. Alternatively, an expression library can be co-expressed with the polypeptide 
of the invention in cells and assayed for an autocrine response to identify potential ligand(s). As 
still another example, BLAcore assays, gel overlay assays, or other methods known in the art can 
be used to identify binding partner polypeptides, including, (1) organic and inorganic chemical 

25 libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of random 
peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 
polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 

30 protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 

with the ligand specific for the extracellular portion of the chimeric protein, thereby activating the 
chimeric receptor. Known downstream proteins involved in intracellular signaling can then be 
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assayed for expected modifications i.e. phosphorylation. Other methods known to those in the art 
can also be used to identify signaling molecules involved in receptor activity. 

4.10.15 ANTI-INFLAMMATORY ACTIVITY 

5 Compositions of the present invention may also exhibit anti-inflammatory activity. The 

anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 
inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production of 

10 other factors which more directly inhibit or promote an inflammatory response. Compositions 
with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfiision injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

1 5 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1. Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 

20 arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

25 

4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
30 acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, promyelocyte, 
myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 
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(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al, 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 



4.10.17 NERVOUS SYSTEM DISORDERS 

5 Nervous system disorders, involving cell types which can be tested for efficacy of 

intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
therapeutic utility, include but are not limited to nervous system injuries, and diseases or disorders 
which result in either a disconnection of axons, a diminution or degeneration of neurons, or 
10 demyelination. Nervous system lesions which may be treated in a patient (including human and 
non-human mammalian patients) according to the invention include but are not limited to the 
following lesions of either the central (including spinal cord, brain) or peripheral nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
surgery, for example, lesions which sever a portion of the nervous system, or compression 

15 injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 
infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or injured 
20 as a result of infection, for example, by an abscess or associated with infection by human 

immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 
tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 

25 with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 
sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 

30 tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 
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(vi) neurological lesions associated with systemic diseases including but not limited to 

diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 
sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
5 neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 
immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

10 Therapeutics which are useful according to the invention for treatment of a nervous system 

disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
any of the following effects may be useful according to the invention: 
(i) increased survival time of neurons in culture; 

1 5 (ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 
choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, non-limiting 
20 embodiments, increased survival of neurons may be measured by the method set forth in Arakawa 
et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may be detected by 
methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. 
Rev. Neurosci. 4: 17-42); increased production of neuron-associated molecules may be measured 
by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., depending on the 
25 molecule to be measured; and motor neuron dysfunction may be measured by assessing the 
physical manifestation of motor neuron disorder, e.g., weakness, motor neuron conduction 
velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 
invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
30 trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 
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muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 

muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 

poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 

(Charcot-Marie-Tooth Disease). 

5 

4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing or 

10 enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape (such 
as, for example, breast augmentation or diminution, change in bone form or shape); effecting 
biorhythms or circadian cycles or rhythms; effecting the fertility of male or female subjects; 
effecting the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of 

1 5 dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other nutritional factors 
or component(s); effecting behavioral characteristics, including, without limitation, appetite, 
libido, stress, cognition (including cognitive disorders), depression (including depressive 
disorders) and violent behaviors; providing analgesic effects or other pain reducing effects; 
promoting differentiation and growth of embryonic stem cells in lineages other than 

20 hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen in 
a vaccine composition to raise an immune response against such protein or another material or 

25 entity which is cross-reactive with such protein. 



4.10.19 IDENTIFICATION OF POLYMORPHISMS 

The demonstration of polymorphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
30 and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 
response) or a differential response to drug administration, and this genetic information can be 
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used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 
polymorphism. 

5 Polymorphisms can be identified in a variety of ways known in the art which all generally 

involve obtaining a sample from a patient, analyzing DNA from the sample, optionally involving 
isolation or amplification of the DNA, and identifying the presence of the polymorphism in the 
DNA. For example, PCR may be used to amplify an appropriate fragment of genomic DNA 
which may then be sequenced. Alternatively, the DNA may be subjected to allele-specific 

10 oligonucleotide hybridization (in which appropriate oligonucleotides are hybridized to the DNA 
under conditions permitting detection of a single base mismatch) or to a single nucleotide 
extension assay (in which an oligonucleotide that hybridizes immediately adjacent to the position 
of the polymorphism is extended with one or more labeled nucleotides). In addition, traditional 
restriction fragment length polymorphism analysis (using restriction enzymes that provide 

15 differential digestion of the genomic DNA depending on the presence or absence of the 

polymorphism) may be performed. Arrays with nucleotide sequences of the present invention can 
be used to detect polymorphisms. The array can comprise modified nucleotide sequences of the 
present invention in order to detect the nucleotide sequences of the present invention. In the 
alternative, any one of the nucleotide sequences of the present invention can be placed on the 

20 array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could also 
be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., by an 
antibody specific to the variant sequence. 

25 4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 
arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., 1983, 
Science, 219:56, or by B. Waksman et al, 1963, Int. Arch. Allergy Appl. Immunol., 23:129. 
30 Induction of the disease can be caused by a single injection, generally intradermally, of a 

suspension of killed Mycobacterium tuberculosis in complete Freund f s adjuvant (CFA). The route 
of injection can vary, but rats may be injected at the base of the tail with an adjuvant mixture. The 

67 



WO 02/077180 PCT/US02/08964 
polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 1-5 mg/kg. 

The control consists of administering PBS only. 

The procedure for testing the effects of the test compound would consist of intradermally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 

5 test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 

24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 

described by J. Holoskitz above. An analysis of the data would reveal that the test compound 

would have a dramatic affect on the swelling of the joints as measured by a decrease of the 

arthritis score. 

10 

4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
15 numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
include, but are not limited to, those exemplified herein. 

4.11.1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of the 
20 polypeptides or other composition of the invention to individuals affected by a disease or disorder 
that can be modulated by regulating the peptides of the invention. While the mode of 
administration is not particularly important, parenteral administration is preferred. An exemplary 
mode of administration is to deliver an intravenous bolus. The dosage of the polypeptides or 
other composition of the invention will normally be determined by the prescribing physician. It is 
25 to be expected that the dosage will vary according to the age, weight, condition and response of 
the individual patient. Typically, the amount of polypeptide administered per dose will be in the 
range of about 0.0 1 ng/kg to 1 00 mg/kg of body weight, with the preferred dose being about 
O.ljig/kg to 10 mg/kg of patient body weight. For parenteral administration, polypeptides of the 
invention will be formulated in an injectable form combined with a pharmaceutically acceptable 
30 parenteral vehicle. Such vehicles are well known in the art and examples include water, saline, 
Ringer's solution, dextrose solution, and solutions consisting of small amounts of the human 
serum albumin. The vehicle may contain minor amounts of additives that maintain the isotonicity 
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and stability of the polypeptide or other active ingredient. The preparation of such solutions is 
within the skill of the art. 



5 4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 

ADMINISTRATION 

A protein or other composition of the present invention (from whatever source derived, 
including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered to 

1 0 a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 
carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 

1 5 effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, 
IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 

20 factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
factor (PDGF), transforming growth factors (TGF-a and TGF-P), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

25 The pharmaceutical composition may further contain other agents which either enhance 

the activity of the protein or other active ingredient or complement its activity or use in treatment. 
Such additional factors and/or agents may be included in the pharmaceutical composition to 
produce a synergistic effect with protein or other active ingredient of the invention, or to 
minimize side effects. Conversely, protein or other active ingredient of the present invention may 

30 be included in formulations of the particular clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or antithrombotic factor, or anti- inflammatory agent to 
minimize side effects of the clotting factor, cytokine, lymphokine, other hematopoietic factor, 
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thrombolytic or antithrombotic factor, or anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, IL- 
1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein of the present invention 
may be active in multimers (e.g., heterodimers or homodimers) or complexes with itself or other 
proteins. As a result, pharmaceutical compositions of the invention may comprise a protein of the 
5 invention in such multimeric or complexed form. 

As an alternative to being included in ^pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently administered 
with the first protein (e.g., at the same time, or at differing times provided that therapeutic 
concentrations of the combination of agents is achieved at the treatment site). Techniques for 

10 formulation and administration of the compounds of the instant application may be found in 
"Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest edition. A 
therapeutically effective dose further refers to that amount of the compound sufficient to result in 
amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the relevant 
medical condition, or an increase in rate of treatment, healing, prevention or amelioration of such 

15 conditions. When applied to an individual active ingredient, administered alone, a therapeutically 
effective dose refers to that ingredient alone. When applied to a combination, a therapeutically 
effective dose refers to combined amounts of the active ingredients that result in the therapeutic 
effect, whether administered in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 

20 effective amount of protein or other active ingredient of the present invention is administered to a 
mammal having a condition to be treated. Protein or other active ingredient of the present 
invention may be administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines or other 
hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 

25 hematopoietic factors, protein or other active ingredient of the present invention may be 

administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or antithrombotic factors, or sequentially. If administered sequentially, 
the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 

30 hematopoietic factor(s), thrombolytic or antithrombotic factors. 



4.12.1 ROUTES OF ADMINISTRATION 
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Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 

intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 

intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 

intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 

5 ingredient of the present invention used in the pharmaceutical composition or to practice the 

method of the present invention can be carried out in a variety of conventional ways, such as oral 

ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral or 

intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 

1 0 example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often in 
a depot or sustained release formulation. In order to prevent the scarring process frequently 
occurring as complication of glaucoma surgery, the compounds may be administered topically, 
for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 

1 5 arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 
afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and an 
effective dosage for a particular indication is within the level of skill in the art. Preferably for 
20 wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated from these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit. 



25 4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may be 
formulated in a conventional manner using one or more physiologically acceptable carriers 
comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. These pharmaceutical compositions may be 
30 manufactured in a manner that is itself known, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing 
processes. Proper formulation is dependent upon the route of administration chosen. When a 
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therapeutically effective amount of protein or other active ingredient of the present invention is 

administered orally, protein or other active ingredient of the present invention will be in the form 

of a tablet, capsule, powder, solution or elixir. When administered in tablet form, the 

pharmaceutical composition of the invention may additionally contain a solid carrier such as a 

5 gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein or 

other active ingredient of the present invention, and preferably from about 25 to 90% protein or 

other active ingredient of the present invention. When administered in liquid form, a liquid 

carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, mineral oil, 

soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical 

10 composition may further contain physiological saline solution, dextrose or other saccharide 
solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When 
administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% by 
weight of protein or other active ingredient of the present invention, and preferably from about 1 
to 50% protein or other active ingredient of the present invention. 

15 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 
active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 

20 the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 
Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other 
vehicle as known in the art. The pharmaceutical composition of the present invention may also 

25 contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of skill 
in the art. For injection, the agents of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 
physiological saline buffer. For transmucosal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in the art. 

30 For oral administration, the compounds can be formulated readily by combining the active 

compounds with pharmaceutical^ acceptable carriers well known in the art. Such carriers enable 
the compounds of the invention to be formulated as tablets, pills, dragees, capsules, liquids, gels, 
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syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated. 
Pharmaceutical preparations for oral use can be obtained from a solid excipient, optionally 
grinding a resulting mixture, and processing the mixture of granules, after adding suitable 
auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, 
5 fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such 
as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, 
methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. 

10 Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar solutions 
may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound doses. 

1 5 Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 
sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 

20 suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of tablets 
or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present invention 

25 are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or 
a nebuliser, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver a 
metered amount. Capsules and cartridges of, e.g., gelatin for use in an inhaler or insufflator may 

30 be formulated containing a powder mix of the compound and a suitable powder base such as 
lactose or starch. The compounds may be formulated for parenteral administration by injection, 
e.g., by bolus injection or continuous infusion. Formulations for injection may be presented in 
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unit dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing 
agents. 

5 Pharmaceutical formulations for parenteral administration include aqueous solutions of the 

active compounds in water-soluble form. Additionally, suspensions of the active compounds may 
be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles 
include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 

10 increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 

dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase 
the solubility of the compounds to allow for the preparation of highly concentrated solutions. 
Alternatively, the active ingredient may be in powder form for constitution with a suitable 
vehicle, e.g., sterile pyrogen-free water, before use. 

1 5 The compounds may also be formulated in rectal compositions such as suppositories or 

retention enemas, eg., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 

20 Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 
system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 

25 an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 
polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 

30 administration. Naturally, the proportions of a co-solvent system may be varied considerably 
without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
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be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
5 known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. Various 
types of sustained-release materials have been established and are well known by those skilled in 

1 0 the art. Sustained-release capsules may, depending on their chemical nature, release the 

compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers or 

15 excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 
provided as salts with phannaceutically compatible counter ions. Such pharmaceutically 
acceptable base addition salts are those salts which retain the biological effectiveness and 

20 properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 
monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
25 protein(s) or other active ingredient(s) of present invention along with protein or peptide antigens. 
The protein and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. 
B lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
lymphocytes will respond to antigen through the T cell receptor (TCR) following presentation of 
the antigen by MHC proteins. MHC and structurally related proteins including those encoded by 
30 class I and class II MHC genes on host cells will serve to present the peptide antigen(s) to T 
lymphocytes. The antigen components could also be supplied as purified MHC-peptide 
complexes alone or with co-stimulatory molecules that can directly signal T cells. Alternatively 
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antibodies able to bind surface immunoglobulin and other molecules on B cells as well as 
antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
5 which protein of the present invention is combined, in addition to other pharmaceutically 
acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
10 liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 

Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated herein 
by reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will depend upon the nature and severity of 

1 5 the condition being treated, and on the nature of prior treatments which the patient has undergone. 
Ultimately, the attending physician will decide the amount of protein or other active ingredient of 
the present invention with which to treat each individual patient. Initially, the attending physician 
will administer low doses of protein or other active ingredient of the present invention and 
observe the patient's response. Larger doses of protein or other active ingredient of the present 

20 invention may be administered until the optimal therapeutic effect is obtained for the patient, and 
at that point the dosage is not increased further. It is contemplated that the various pharmaceutical 
compositions used to practice the method of the present invention should contain about 0.01 (ag to 
about 100 mg (preferably about 0.1 ng to about 10 mg, more preferably about 0.1 \ig to about 1 
mg) of protein or other active ingredient of the present invention per kg body weight. For 

25 * compositions of the present invention which are useful for bone, cartilage, tendon or ligament 
regeneration, the therapeutic method includes administering the composition topically, 
systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 
form. Further, the composition may desirably be encapsulated or injected in a viscous form for 

30 delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable for 
wound healing and tissue repair. Therapeutically useful agents other than a protein or other active 
ingredient of the invention which may also optionally be included in the composition as described 
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above, may alternatively or additionally, be administered simultaneously or sequentially with the 

composition in the methods of the invention. Preferably for bone and/or cartilage formation, the 

composition would include a matrix capable of delivering the protein-containing or other active 

ingredient-containing composition to the site of bone and/or cartilage damage, providing a 

5 structure for the developing bone and cartilage and optimally capable of being resorbed into the 

body. Such matrices may be formed of materials presently in use for other implanted medical 

applications. 

The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 

10 compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 
hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 

1 5 matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and tricalcium 
phosphate. The bioceramics may be altered in composition, such as in 
calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 

20 biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 

glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 
cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

25 A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 

(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 
hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 
carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 

30 polyethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and polyvinyl alcohol). 
The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
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protein from the polymer matrix and to provide appropriate handling of the composition, yet not 

so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 

protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 

compositions, proteins or other active ingredients of the invention may be combined with other 

5 agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. 

These agents include various growth factors such as epidermal growth factor (EGF), platelet 

derived growth factor (PDGF), transforming growth factors (TGF-ot and TGF-P), and insulin-like 

growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 

10 Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 

patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 

15 damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution and 
with inclusion of other proteins in the pharmaceutical composition. For example, the addition of 
other known growth factors, such as IGF I (insulin like growth factor I), to the final composition, 

20 may also effect the dosage. Progress can be monitored by periodic assessment of tissue/bone 
growth and/or repair, for example, X-rays, histomorphometric determinations and tetracycline 
labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
25 mammalian subject. Polynucleotides of the invention may also be administered by other known 
methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or activity 
in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

30 

4.12.3 EFFECTIVE DOSAGE 
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Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 
intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
5 treated. Determination of the effective amount is well within the capability of those skilled in the 
art, especially in light of the detailed disclosure provided herein. For any compound used in the 
method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately determine useful doses in 

10 humans. For example, a dose can be formulated in animal models to achieve a circulating 

concentration range that includes the IC 5 o as determined in cell culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 

1 5 amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the LD 5 o (the dose lethal to 50% of the 
population) and the ED 50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 

20 ratio between LD 5 o and ED 5 o. Compounds which exhibit high therapeutic indices are preferred. 
The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary 
within this range depending upon the dosage form employed and the route of administration 

25 utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval maybe adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 
desired effects, or minimal effective concentration (MEC). The MEC will vary for each 

30 compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 
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Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be related 
5 to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention will 
be in the range of about 0.01 ^ig/kg to 100 mg/kg of body weight daily, with the preferred dose 
being about 0. 1 (ig/kg to 25 mg/kg of patient body weight daily, varying in adults and children. 
Dosing may be once daily, or equivalent doses may be delivered at longer or shorter intervals. 
10 The amount of composition administered will, of course, be dependent on the subject 

being treated, on the subject's age and weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

4,12.4 PACKAGING 

1 5 The compositions may, if desired, be presented in a pack or dispenser device which may 

contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 

20 appropriate container, and labeled for treatment of an indicated condition. 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 

25 immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 
an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F ab , F a f and F( a t>')2 
fragments, and an F a b expression library. In general, an antibody molecule obtained from humans 
relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another by the 

30 nature of the heavy chain present in the molecule. Certain classes have subclasses as well, such as 
IgGi, IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or a lambda 
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chain. Reference herein to antibodies includes a reference to all such classes, subclasses and 

types of human antibody species. 

An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment thereof, and additionally can be used as an immunogen to generate antibodies 

5 that immunospecifically bind the antigen, using standard techniques for polyclonal and 

monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 

invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence of 

the full length protein, such as an amino acid sequence shown in SEQ ID NO: 1-11, and 

10 encompasses an epitope thereof such that an antibody raised against the peptide forms a specific 
immune complex with the full length protein or with any fragment that contains the epitope. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 15 amino 
acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. Preferred 
epitopes encompassed by the antigenic peptide are regions of the protein that are located on its 

1 5 surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a surface region of the protein, e.g., a hydrophilic region. A hydrophobicity 
analysis of the human related protein sequence will indicate which regions of a related protein are 
particularly hydrophilic and, therefore, are likely to encode surface residues useful for targeting 

20 antibody production. As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity and hydrophobicity may be generated by any method well known in the 
art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or without 
Fourier transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat Acad. Sci. USA 78: 3824- 
3828; Kyte and Doolittle 1982, J. Mol Biol 157: 105-142, each of which is incorporated herein 

25 by reference in its entirety. Antibodies that are specific for one or more domains within an 
antigenic protein, or derivatives, fragments, analogs or homologs thereof, are also provided 
herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog thereof, 
may be utilized as an immunogen in the generation of antibodies that immunospecifically bind 
30 these protein components. 

The term "specific for" indicates that the variable regions of the antibodies of the 
invention recognize and bind polypeptides of the invention exclusively {i.e., able to distinguish 
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the polypeptide of the invention from other similar polypeptides despite sequence identity, 
homology, or similarity found in the family of polypeptides), but may also interact with other 
proteins (for example, S. aureus protein A or other antibodies in ELIS A techniques) through 
interactions with sequences outside the variable region of the antibodies, and in particular, in the 
5 constant region of the molecule. Screening assays to determine binding specificity of an antibody 
of the invention are well known and routinely practiced in the art. For a comprehensive 
discussion of such assays, see Harlow et al. (Eds), Antibodies A Laboratory Manual; Cold Spring 
Harbor Laboratory; Cold Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and 
bind fragments of the polypeptides of the invention are also contemplated, provided that the 

10 antibodies are first and foremost specific for, as defined above, full-length polypeptides of the 
invention. As with antibodies that are specific for full length polypeptides of the invention, 
antibodies of the invention that recognize fragments are those which can distinguish polypeptides 
from the same family of polypeptides despite inherent sequence identity, homology, or similarity 
found in the family of proteins. 

15 Antibodies of the invention are useful for, for example, therapeutic purposes (by 

modulating activity of a polypeptide of the invention), diagnostic purposes to detect or quantitate 
a polypeptide of the invention, as well as purification of a polypeptide of the invention. Kits 
comprising an antibody of the invention for any of the purposes described herein are also 
comprehended. In general, a kit of the invention also includes a control antigen for which the 

20 antibody is immunospecific. The invention further provides a hybridoma that produces an 

antibody according to the invention. Antibodies of the invention are useful for detection and/or 
purification of the polypeptides of the invention. 

Monoclonal antibodies binding to the protein of the invention may be useful diagnostic 
agents for the immunodetection of the protein. Neutralizing monoclonal antibodies binding to the 

25 protein may also be useful therapeutics for both conditions associated with the protein and also in 
the treatment of some forms of cancer where abnormal expression of the protein is involved. In 
the case of cancerous cells or leukemic cells, neutralizing monoclonal antibodies against the 
protein may be useful in detecting and preventing the metastatic spread of the cancerous cells, 
which may be mediated by the protein. 

30 The labeled antibodies of the present invention can be used for in vitro, in vivo, and in situ 

assays to identify cells or tissues in which a fragment of the polypeptide of interest is expressed. 
The antibodies may also be used directly in therapies or other diagnostics. The present invention 
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further provides the above-described antibodies immobilized on a solid support. Examples of 
such solid supports include plastics such as polycarbonate, complex carbohydrates such as 
agarose and Sepharose®, acrylic resins and such as polyacrylamide and latex beads. Techniques 
for coupling antibodies to such solid supports are well known in the art (W eir, D.M. et al., 
5 "Handbook of Experimental Immunology" 4th Ed., Blackwell Scientific Publications, Oxford, 
England, Chapter 10 (1986); Jacoby, W.D. et al., Meth. Enzym. 34 Academic Press, N.Y. (1974)). 
The immobilized antibodies of the present invention can be used for in vitro, in vivo, and in situ 
assays as well as for immuno-affinity purification of the proteins of the present invention. 

Various procedures known within the art may be used for the production of polyclonal or 
1 0 monoclonal antibodies directed against a protein of the invention, or against derivatives, 

fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, incorporated herein by reference). Some of these antibodies are discussed below. 



1 5 4.13-1 POLYCLONAL ANTIBODIES 

For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic 
preparation can contain, for example, the naturally occurring immunogenic protein, a chemically 

20 synthesized polypeptide representing the immunogenic protein, or a recombinantly expressed 
immunogenic protein. Furthermore, the protein may be conjugated to a second protein known to 
be immunogenic in the mammal being immunized. Examples of such immunogenic proteins 
include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, 
and soybean trypsin inhibitor. The preparation can further include an adjuvant. Various adjuvants 

25 used to increase the immunological response include, but are not limited to, Freund ! s (complete 
and incomplete), mineral gels (e.g., aluminum hydroxide), surface-active substances (e.g., 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants 
usable in humans such as Bacille Calmette-Guerin and Corynebacterium parvum, or similar 
immunostimulatory agents. Additional examples of adjuvants that can be employed include 

30 MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 

83 
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such as affinity chromatography using protein A or protein G, which provide primarily the IgG 
fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the target 
of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to purify 
the immune specific antibody by immunoaffinity chromatography. Purification of 
5 immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 
Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

4.13.2 MONOCLONAL ANTIBODIES 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 

10 herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain gene 
product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen- 
binding site capable of immunoreacting with a particular epitope of the antigen characterized by a 

1 5 unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those described 
by Kohler and Milstein, Nature . 256:495 (1975). In a hybridoma method, a mouse, hamster, or 
other appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to the 

20 immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 
fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing 

25 agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: 
Principles and Practice, Academic Press, (1986) pp. 59-103). Immortalized cell lines are usually 
transformed mammalian cells, particularly myeloma cells of rodent, bovine and human origin. 
Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the growth or 

30 survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the 
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hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), 
which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
5 such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 
can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol.. 133:3001 (1984); Brodeur et al., Monoclonal 
10 Antibody Production Techniques and Applications. Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for the 
presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 

1 5 immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 

enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigen are 

20 isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 
dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

25 The monoclonal antibodies secreted by the subclones can be isolated or purified from the 

culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, 
for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, 
or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as those 

30 described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 

invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
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light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
5 monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 

example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
10 polypeptide can be substituted for the constant domains of an antibody of the invention, or can be 
substituted for the variable domains of one antigen-combining site of an antibody of the invention 
to create a chimeric bivalent antibody. 



4.13,3 HUMANIZED ANTIBODIES 

1 5 The antibodies directed against the protein antigens of the invention can further comprise 

humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab 1 , F(ab') 2 or other antigen- 

20 binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature . 321 :522-525 (1986); Riechmann et al., Nature. 332:323-327 (1988); Verhoeyen et al, 
Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 

25 corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539). In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues that are found neither in 
the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable 

30 domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
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least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et ah, 1986; Riechmann et al., 1988; and Presta, Curr. On. Str uct. Biol. 
2:593-596(1992)). 



5 4.13.4 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 

10 hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal antibodies 
may be utilized in the practice of the present invention and may be produced by using human 
hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming 

15 human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, including 
phage display libraries (Hoogenboom and Winter, J. Mol. Biol. 227:381 (1991); Marks et al, I 
Mol. Biol.. 222:581 (1991)). Similarly, human antibodies can be made by introducing human 

20 immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 

immunoglobulin genes have been partially or completely inactivated. Upon challenge, human 
antibody production is observed, which closely resembles that seen in humans in all respects, 
including gene rearrangement, assembly, and antibody repertoire. This approach is described, for 
example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, 

25 and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. fNature 368 856-859 
(1994)); Morrison (Nature 368, 812-13 (1994)); Fishwild et al, fNature Biotechnology 14, 845-51 
(1996)); Neuberger fNature Biotechnology 14, 826 (1996)); and Lonberg and Huszar flntern. Rev. 
Immunol. 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals that 

30 are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 
endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
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have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 
provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
5 transgenic animals containing fewer than the full complement of the modifications. The preferred 
embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ as disclosed 
in PCT publications WO 96/33735 and WO 96/34096. This animal produces B cells that secrete 
fully human immunoglobulins. The antibodies can be obtained directly from the animal after 
immunization with an immunogen of interest, as, for example, a preparation of a polyclonal 

10 antibody, or alternatively from immortalized B cells derived from the animal, such as hybridomas 
producing monoclonal antibodies. Additionally, the genes encoding the immunoglobulins with 
human variable regions can be recovered and expressed to obtain the antibodies directly, or can be 
further modified to obtain analogs of antibodies such as, for example, single chain Fv molecules. 
An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

15 expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 

5,939,598. It can be obtained by a method including deleting the J segment genes from at least 
one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain locus, 
the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 

20 and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 
contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed in 
U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a nucleotide 
sequence encoding a heavy chain into one mammalian host cell in culture, introducing an 

25 expression vector containing a nucleotide sequence encoding a light chain into another 

mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 
antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a correlative method for selecting an antibody that binds 

30 immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 
WO 99/53049. 
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4.13.5 FAB FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 

In addition, methods can be adapted for the construction of F a b expression libraries (see e.g., Huse, 

5 et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of monoclonal F a b 

fragments with the desired specificity for a protein or derivatives, fragments, analogs or homologs 

thereof. Antibody fragments that contain the idiotypes to a protein antigen may be produced by 

techniques known in the art including, but not limited to: (i) an F (a b*)2 fragment produced by 

pepsin digestion of an antibody molecule; (ii) an F ab fragment generated by reducing the disulfide 

10 bridges of an F (a b')2 fragment; (iii) an F ab fragment generated by the treatment of the antibody 

molecule with papain and a reducing agent and (iv) F v fragments. 



4.13.6 BISPECIFIC ANTIBODIES 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 

15 have binding specificities for at least two different antigens. In the present case, one of the 

binding specificities is for an antigenic protein of the invention. The second binding target is any 
other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 

20 immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature . 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture often different antibody molecules, of which only one has the correct bispecific 
structure. The purification of the correct molecule is usually accomplished by affinity 

25 chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and in Traunecker et al 9 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 

30 the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
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light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
al, Methods in Enzvmology, 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair of 
5 antibody molecules can be engineered to maximize the percentage of heterodimers that are 

recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 

1 0 chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 
F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 

1 5 fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 
wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to stabilize 
vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments generated are 

20 then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then 
reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an 
equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The 
bispecific antibodies produced can be used as agents for the selective immobilization of enzymes. 
Additionally, Fab' fragments can be directly recovered from E. coli and chemically 

25 coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fragment 
was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 

30 of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly from 
recombinant cell culture have also been described. For example, bispecific antibodies have been 
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produced using leucine zippers. Kostelny et al., J. Immunol 148(5): 1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 
different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can also 
5 be utilized for the production of antibody homodimers. The "diabody" technology described by 
Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative 
mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain 
variable domain (V H ) connected to a light-chain variable domain (V L ) by a linker which is too 
short to allow pairing between the two domains on the same chain. Accordingly, the V H and V L 

10 domains of one fragment are forced to pair with the complementary V L and V H domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making 
bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. 
See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

15 antibodies can be prepared. Tutt et al, J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for 

20 IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD16) so as to focus cellular 
defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies can also 
be used to direct cytotoxic agents to cells which express a particular antigen. These antibodies 
possess an antigen-binding arm and an arm which binds a cytotoxic agent or a radionuclide 
chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest 

25 binds the protein antigen described herein and further binds tissue factor (TF). 



4.13.7 HETEROCONJUGATE ANTIBODIES 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
30 have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 
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protein chemistry, including those involving crosslinking agents. For example, immunotoxins can 

be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples of 

suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and 

those disclosed, for example, in U.S. Patent No. 4,676,980. 

5 

4.13.8 EFFECTOR FUNCTION ENGINEERING 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, 
cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide 

10 bond formation in this region. The homodimeric antibody thus generated can have improved 
internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et aL, J. Exp Med., 176: 1 191-1 195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 
tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 

15 et al. Cancer Research, 53; 2560-2565 (1993). Alternatively, an antibody can be engineered that 
has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 



4.13.9 IMMUNOCONJUGATES 

20 The invention also pertains to immunoconjugates comprising an antibody conjugated to a 

cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
25 described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites 
fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and PAP-S), 
momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, 
30 restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are 

available for the production of radioconjugated antibodies. Examples include 212 Bi, 13, 1, 131 In, 
90 Y, and 186 Re. 
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Conjugates of the antibody and cytotoxic agent are made using a variety of bifiinctional 

protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 

iminothiolane (IT), bifiinctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 

active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 

5 compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis- 

(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and 

bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin 

immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). Carbon- 

14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is 

10 an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 

WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such streptavidin) 
for utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to 
the patient, followed by removal of unbound conjugate from the circulation using a clearing agent 
15 and then administration of a "ligand" (e.g., avidin) that is in turn conjugated to a cytotoxic agent. 



4.14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
20 be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium which can be read and accessed directly by a computer. Such media include, but are 
not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
25 artisan can readily appreciate how any of the presently known computer readable mediums can be 
used to create a manufacture comprising computer readable medium having recorded thereon a 
nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 
storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
30 manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
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invention. The choice of the data storage structure will generally be based on the means chosen to 
access the stored information. In addition, a variety of data processor programs and formats can 
be used to store the nucleotide sequence information of the present invention on computer 
readable medium. The sequence information can be represented in a word processing text file, 
5 formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

1 0 By providing any of the nucleotide sequences SEQ ID NOs: 1 - 1 1 or a representative 

fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 
sequences of SEQ ID NOs: 1 - 1 1 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 

15 readable medium. The examples which follow demonstrate how software which implements the 
BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 

20 fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software means, 
and data storage means used to analyze the nucleotide sequence information of the present 
invention. The minimum hardware means of the computer-based systems of the present invention 
comprises a central processing unit (CPU), input means, output means, and data storage means. A 

25 skilled artisan can readily appreciate that any one of the currently available computer-based 

systems are suitable for use in the present invention. As stated above, the computer-based systems 
of the present invention comprise a data storage means having stored therein a nucleotide 
sequence of the present invention and the necessary hardware means and software means for 
supporting and implementing a search means. As used herein, "data storage means" refers to 

30 memory which can store nucleotide sequence information of the present invention, or a memory 
access means which can access manufactures having recorded thereon the nucleotide sequence 
information of the present invention. 
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As used herein, "search means" refers to one or more programs which are implemented on 
the computer-based system to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
5 motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 
systems of the present invention. Examples of such software include, but are not limited to, 
Smith- Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 

1 0 software packages for conducting homology searches can be adapted for use in the present 

computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

15 sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of shorter 
length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
20 selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are a 
variety of target motifs known in the art. Protein target motifs include, but are not limited to, 
enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
25 sequences). 



4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
30 methods are based on the binding of a polynucleotide sequence to DNA or RNA, Polynucleotides 
suitable for use in these methods are preferably 20 to 40 bases in length and are designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al, Nucl. 
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Acids Res. 6:3073 (1979); Cooney et al, Science 15241:456 (1988); and Dervan et al., Science 
251:1360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 56:560 (1991); 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 
(1988)). Triple helix-formation optimally results in a shut-off of RNA transcription from DNA, 
5 while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. 
Both techniques have been demonstrated to be effective in model systems. Information contained 
in the sequences of the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide. 



10 4,16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 
with a suitable label. 

1 5 In general, methods for detecting a polynucleotide of the invention can comprise 

contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 
detected, a polynucleotide of the invention is detected in the sample. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 

20 that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting a 
sample with a compound that binds to and forms a complex with the polypeptide for a period 

25 sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 
polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

30 Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
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skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, T., 
An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
5 Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the present 
invention include cells, protein or membrane extracts of cells, or biological fluids such as sputum, 

10 blood, serum, plasma, or urine. The test sample used in the above-described method will vary 
based on the assay format, nature of the detection method and the tissues, cells or extracts used as 
the sample to be assayed. Methods for preparing protein extracts or membrane extracts of cells 
are well known in the art and can be readily be adapted in order to obtain a sample which is 
compatible with the system utilized. 

1 5 In another embodiment of the present invention, kits are provided which contain the 

necessary reagents to carry out the assays of the present invention. Specifically, the invention 
provides a compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 

20 reagents, reagents capable of detecting presence of a bound probe or antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in separate 
containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 

25 agents or solutions of each container can be added in a quantitative fashion from one compartment 
to another. Such containers will include a container which will accept the test sample, a container 
which contains the antibodies used in the assay, containers which contain wash reagents (such as 
phosphate buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to 
detect the bound antibody or probe. Types of detection reagents include labeled nucleic acid 

30 probes, labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the 
enzymatic, or antibody binding reagents which are capable of reacting with the labeled antibody. 
One skilled in the art will readily recognize that the disclosed probes and antibodies of the present 
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invention can be readily incorporated into one of the established kit formats which are well known 

in the art. 

4.17 MEDICAL IMAGING 

5 The novel polypeptides and binding partners of the invention are useful in medical 

imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et al, U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of a 
labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
10 pharmaceutical^ acceptable carrier, and imaging the labeled polypeptide in vivo at the target site. 

4.1 8 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a polypeptide encoded 
15 by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NOs: 1 - 1 1 , or 
bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, said method 
comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
invention, or nucleic acid of the invention; and 
20 (b) determining whether the agent binds to said protein or said nucleic acid. 

In general, therefore, such methods for identifying compounds that bind to a 
polynucleotide of the invention can comprise contacting a compound with a polynucleotide of the 
invention for a time sufficient to form a polynucleotide/compound complex, and detecting the 
complex, so that if a polynucleotide/compound complex is detected, a compound that binds to a 
25 polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
invention for a time sufficient to form a polypeptide/compound complex, and detecting the 
complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 
30 polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can also 
comprise contacting a compound with a polypeptide of the invention in a cell for a time sufficient 
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to form a polypeptide/compound complex, wherein the complex drives expression of a receptor 

gene sequence in the cell, and detecting the complex by detecting reporter gene sequence 

expression, so that if a polypeptide/compound complex is detected, a compound that binds a 

polypeptide of the invention is identified. 

5 Compounds identified via such methods can include compounds which modulate the 

activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 

activity observed in the absence of the compound). Alternatively, compounds identified via such 

methods can include compounds which modulate the expression of a polynucleotide of the 

invention (that is, increase or decrease expression relative to expression levels observed in the 

10 absence of the compound). Compounds, such as compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 

15 and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 

20 based on the configuration of the particular protein. For example, one skilled in the art can 

readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 

25 Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly 
described, can be used to control gene expression through binding to one of the ORFs or EMFs of 
the present invention. As described above, such agents can be randomly screened or rationally 
designed/selected. Targeting the ORF or EMF allows a skilled artisan to design sequence specific 

30 or element specific agents, modulating the expression of either a single ORF or multiple ORFs 
which rely on the same EMF for expression control. One class of DNA binding agents are agents 
which contain base residues which hybridize or form a triple helix formation by binding to DNA 
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or RNA. Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or 
can be a variety of sulfhydryl or polymeric derivative' which have base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
5 Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al, Science 241 :456 (1988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
10 polypeptide. Both techniques have been demonstrated to be effective in model systems. 

Information contained in the sequences of the present invention is necessary for the design of an 
antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention can 
be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the 
1 5 present invention can be formulated using known techniques to generate a pharmaceutical 
composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
20 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NOs: 1-11. Because the corresponding gene is only expressed in a limited 
number of tissues, a hybridization probe derived from any of the nucleotide sequences SEQ ID 
NOs: 1 - 1 1 can be used as an indicator of the presence of RNA of cell type of such a tissue in a 
25 sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides additional 
uses for oligonucleotides based upon the nucleotide sequences. Such probes used in PCR may be 
of recombinant origin, may be chemically synthesized, or a mixture of both. The probe will 
30 comprise a discrete nucleotide sequence for the detection of identical sequences or a degenerate 
pool of possible sequences for identification of closely related genomic sequences. 
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Other means for producing specific hybridization probes for nucleic acids include the 
cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 
5 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 
techniques include in situ hybridization, linkage analysis against known chromosomal markers, 

10 hybridization screening with libraries or flow-sorted chromosomal preparations specific to known 
chromosomes, and the like. The technique of fluorescent in situ hybridization of chromosome 
spreads has been described, among other places, in Verma et al (1988) Human Chromosomes: A 
Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

1 5 chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265: 198 If). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 
genetic disease. The nucleotide sequences of the subject invention may be used to detect 

20 differences in gene sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for example, 
directly synthesizing the oligonucleotide by chemical means, as is commonly practiced using an 
automated oligonucleotide synthesizer. 

25 Support bound oligonucleotides may be prepared by any of the methods known to those of 

skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 
achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6) 1469-72); 
using UV light (Nagata et al, 1985; Dahlen et al f 1987; Monissey & Collins, (1989) Mol. Cell 

30 Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller et al t 1988; 1989); all 
references being specifically incorporated herein. 
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Another strategy that may be employed is the use of the strong biotin-streptavidin interaction 

as a linker. For example, Broude et al (1994) Proc. Natl. Acad. Sci. USA 91(8) 3072-6, describe the 

use of biotinylated probes, although these are duplex probes, that are immobilized on 

streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, Oslo. 

5 Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 

Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies 

(Alameda, CA). 

Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc 
Laboratories have developed a method by which DNA can be covalently bound to the microwell 

1 0 surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 
purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
5'-end by a phosphoramidate bond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussen etal, (1991) Anal. Biochem. 198(1) 138-42). 

1 5 The use of CovaLink NH strips for covalent binding of DNA molecules at the 5-end has 

been described (Rasmussen et al., (1991). In this technology, a phosphoramidate bond is employed 
(Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as immobilization using 
only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the CovaLink 
NH secondary amino groups that are positioned at the end of spacer arms covalently grafted onto the 

20 polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to CovaLink NH via 
a phosphoramidate bond, the oligonucleotide terminus must have a 5'-end phosphate group. It is, 
perhaps, even possible for biotin to be covalently bound to CovaLink and then streptavidin used to 
bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
25 denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1-methylimidazole, pH 
7.0 (l-Melmv), is then added to a final concentration of 10 mM l-Melm?. A ss DNA solution is then 
dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 
10 mM 1-Melm 7 , is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
30 50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; first the wells are 

washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 3 
times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 
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It is contemplated that a further suitable method for use with the present invention is that 
described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
5 hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported 
nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 

1 0 arrays may be employed. For example, addressable laser-activated photodeprotection may be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodor et al (1991) Science 251(4995) 767-73, incorporated herein by reference. Probes may also be 
immobilized on nylon supports as described by Van Ness et al (1991) Nucleic Acids Res. 19(12) 
3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 169(1) 

1 5 104-8; all references being specifically incorporated herein. 

To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 
requires activation of the nylon surface via alkylation and selective activation of the 5'-amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the light-generated 

20 synthesis described by Pease et al, (1994) PNAS USA 91(11) 5022-6, incorporated herein by 
reference). These authors used current photolithographic techniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct 
the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5-protected N-acyl-deoxynucleoside phosphoramidites, surface linker chemistry and versatile 

25 combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids maybe obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
30 including mRNA without any amplification steps. For example, Sambrook et al. (1 989) describes 
three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 9.14-9.23). 
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DNA fragments may be prepared as clones in M13, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples 
may be prepared or dispensed in multiwell plates. About 1 00-1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 
5 The nucleic acids would then be fragmented by any of the methods known to those of skill in 

the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et al 
(1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) Nucleic 
Acids Res. 18(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
1 0 passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
device allows controlled application of low to intermediate pressures to the cell. The results of these 
studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
1 5 base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 

20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy 
20 between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
this enzyme (Cv/JI**), yield a quasi-random distribution of DNA fragments form the small molecule 
pUC19 (2688 base pairs). Fitzgerald et al (1992) quantitatively evaluated the randomness of this 
fragmentation strategy, using a Cv/JI** digest of pUC19 that was size fractionated by a rapid gel 
filtration method and directly ligated, without end repair, to a lac Z minus Ml 3 cloning vector. 
25 Sequence analysis of 76 clones showed that Cv/JI** restricts pyGCPy and PuGCPu, in addition to 
PuGCPy sites, and that new sequence data is accumulated at a rate consistent with random 
fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and agarose 
gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 ug); and 
30 fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and elution are needed 
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Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is 
important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the chip. 
5 Phosphate groups must also be removed from genomic DNA by methods known in the art. 

4-22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 
array of wells in a microliter plate) to repeated by transfer of about 20 nl of a DNA solution to a 

1 0 nylon membrane. By offset printing, a density of dots higher than the density of the wells is 

achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 
may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 

1 5 subarrays may represent replica spotting of the same samples. In one example, a selected gene 

segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 
one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples maybe spotted on one 8 x 12 cm membrane. 
Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 

20 dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 

25 screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of the 
present disclosure, one of skill in the art will appreciate that many other embodiments and variations 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 

30 present invention is not to be limited in scope by the exemplified embodiments which are intended as 
illustrations of single aspects of the invention, and compositions and methods which are functionally 
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equivalent are within the scope of the invention. Indeed, numerous modifications and variations in 

the practice of the invention are expected to occur to those skilled in the art upon consideration of the 

present preferred embodiments. Consequently, the only limitations which should be placed upon the 

scope of the invention are those which appear in the appended claims. 

5 All references cited within the body of the instant specification are hereby incorporated by 

reference in their entirety. 

5. EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

1 0 A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 

human tissues and in some cases isolated from a genomic library derived from human chromosome 
using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences which 
flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and screened 

1 5 with oligonucleotide probes (e.g., 7-mers) to obtain signature sequences. The clones were clustered 
into groups of similar or identical sequences. Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye terminator 
cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems (ABI) 

20 sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random Amplification 
of cDNA Ends) was performed to further extend the sequence in the 5' direction. 

5.2 EXAMPLE 2 
Novel Nucleic Acids 

The novel nucleic acids of the present invention of the invention were assembled from 
25 sequences that were obtained from a cDNA library by methods described in Example 1 above, and in 
some cases sequences obtained from one or more public databases. The nucleic acids were 
assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend the seed 
EST into an extended assemblage, by pulling additional sequences from different databases (i.e., 
Hyseq's database containing EST sequences, dbEST, gb pri and UniGene) that belong to this 
30 assemblage. The algorithm terminated when there were no additional sequences from the above 
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databases that would extend the assemblage. Inclusion of component sequences into the assemblage 
was based on a BLASTN hit to the extending assemblage with BLAST score greater than 300 and 
percent identity greater than 95%. 

Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full length gene cDNA sequence 
5 and its corresponding protein sequence were generated from the assemblage. Any frame shifts and 
incorrect stop codons were corrected by hand editing. During editing, the sequence was checked 
using FASTY and/or BLAST against Genbank (i.e., dbEST, gb pri, UniGene, Genpept) and the 
amino acid version of Genseq. Other computer programs which may have been used in the editing 
process were phredPhrap and Consed (University of Washington) and ed-ready, ed-ext and cg-zip-2 
10 (Hyseq, Inc.). The full-length nucleotide and amino acid sequences, including splice variants 
resulting from these procedures are shown in the Sequence Listing as SEQ ID NOS: 1-11. 
Table 1 shows the various tissue sources of SEQ ID NO: 1-11. 

The nearest neighbor results for polypeptides encoded by SEQ ID NO: 1-1 1 were obtained 
by a BLASTP (BLAST 1.2.3-Paracell (2001-1 1-20)) search against Genpept, Genseq and 

1 5 SwissProt databases using BLAST algorithm. The nearest neighbor result showed the closest 
homologue with functional annotation for SEQ ID NO: 1-11. The translated amino acid 
sequences for which the nucleic acid sequence encodes are shown in the Sequence Listing. The 
homologues with identifiable functions for SEQ ID NO: 1-1 1 are shown in Table 2 below. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et aL, J. Comp. 

20 Biol. Vol. 6 pp. 219-235 (1999) herein incorporated by reference), polypeptides encoded by SEQ 
ID NO: 1-11 were examined to determine whether they had identifiable signature regions. Table 
3 shows the signature region found in the indicated polypeptide sequences, the description of the 
signature, the eMatrix p-value(s) and the position(s) of the signature within the polypeptide 
sequence. 

25 Using the Pfam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1) pp. 

320-322 (1998) herein incorporated by reference) polypeptides encoded by SEQ ID NO: 1-11 
were examined for domains with homology to certain peptide domains. Table 4 shows the name 
of the domain found, the description, the product of all the e- value of similar domains found, the 
Pfam score for the identified domain within the sequence, number of similar domains found, and 

30 the position of the domain in the SEQ ED NO: being interrogated. 

The GeneAtlas™ software package (Molecular Simulations Inc. (MSI), San Diego, CA) 
was used to predict the three-dimensional structure models for the polypeptides encoded by SEQ 
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ID NO: 1-11. Models were generated by (1) PSI-BLAST which is a multiple alignment sequence 
profile-based searching developed by Altschul et al., (Nucl. Acids Res. 25, 3389-3408 (1997)), (2) 
High Throughput Modeling (HTM) (Molecular Simulations Inc. (MSI) San Diego, CA) which is 
an automated sequence and structure searching procedure (http://www.msi.com/) , and (3) 
5 SeqFold™ which is a fold recognition method described by Fischer and Eisenberg (J. Mol. Biol. 
209, 779-791 (1998)). This analysis was carried out, in part, by comparing the polypeptides of 
the invention with the known NMR (nuclear magnetic resonance) and x-ray crystal three- 
dimensional structures as templates. Table 5 shows, "PDB ID", the Protein DataBase (PDB) 
identifier given to template structure; "Chain ID", identifier of the subcomponent of the PDB 

10 template structure; "Compound Information", information of the PDB template structure and/or 
its subcomponents; "PDB Function Annotation" gives function of the PDB template as annotated 
by the PDB files rtittp://www.rcsb.org/PDBA ; start and end amino acid position of the protein 
sequence aligned; PSI-BLAST score, the verify score, the SeqFold score, and the Potential(s) of 
Mean Force (PMF). The verify score produced by GeneAtlas™ software (MSI), is based on Dr. 

1 5 Eisenberg's Profile-3D threading program developed in Dr. David Eisenberg's laboratory (U.S. 
Patent No. 5,436,850 and Luthy, Bowie, and Eisenberg, Nature, 356:83-85 (1992)) and a 
publication by R. Sanchez and A. Sali, Proc. Natl. Acad. Sci. USA, 95:12502-13597. The verify 
score produced by GeneAtlas™ normalizes the verify score for proteins with different lengths so 
that a unified cutoff can be used to select good models as follows: 

20 

Verify score (normalized) = (raw score - 1/2 high score)/(l/2 high score) 

The PMF score, produced by GeneAtlas™ software (MSI), is a composite scoring 
function that depends in part on the compactness of the model, sequence identity in the alignment 

25 used to build the model, pairwise and surface mean force potential (MFP). As given in Table 5, a 
verify score between 0 to 1.0, with 1 being the best, represents a good model. Similarly, a PMF 
score between 0 to 1.0 with 1 being the best, represents a good model. A SeqFold™ score of 
more than 50 is considered significant. A good model may also be determined by one of skill in 
the art based on all the information in Table 5 taken in totality. 

30 The nucleotide sequence within the sequences that codes for signal peptide sequences and 

their cleavage sites can be determined from using Neural Network SignalP VI. 1 program (from 
Center for Biological Sequence Analysis, The Technical University of Denmark). The process for 
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identifying prokaryotic and eukaryotic signal peptides and their cleavage sites are also disclosed 
by Henrik Nielson, Jacob Engelbrecht, Soren Brunak, and Gunnar von Heijne in the publication 
"Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage 
sites" Protein Engineering, Vol 10, no. 1, pp. 1-16 (1997), incorporated herein by reference. A 
maximum S score and a mean S score, as described in the Nielson et al., as reference, were 
obtained for the polypeptide sequences. Table 6 shows the position of the last amino acid of the 
signal peptide in each of the polypeptides and the maximum score and mean score associated with 
that signal peptide. 

Table 7 correlates each of SEQ ID NO: 1-1 1 to a specific chromosomal location. 

Table 8 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1-11, 
novel polypeptide sequences SEQ ID NO: 1-11, and their corresponding priority nucleotide 
sequences in the priority application USSN 09/815,925, herein incorporated by reference in its 
entirety. 
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TABLE 1 



PCT/US02/08964 





Librarv/RNA 
Source 


HYSEQ Library Name 


SEQ ID NOS: 


adult brain 


BioChain 


ABR012 


3 


adult brain 


Clontech 


ABR001 


3 


adult brain 


Clontech 


ABR008 


3 610 


adult brain 


GIBCO 


ABD003 


3 


adult heart 


GIBCO 


AHR001 


35 


adult kidney 


GIBCO 


AKD001 


379 


adult kidney 


Invitrogen 


AKT002 


69 


adult lunff 


GIBCO 


ALG001 


69 


adult snleen 


Clontech 


SPLcOl 


9 


bladder 


Invitrogen 


BLD001 


1 5 


bone marrow 


Clontech 


BMD001 


9 


bone marrow 


GF 


BMD002 


1 3 


cervix 


BioChain 


CVX001 


79 


colon 


Invitrogen 


CLN001 


29 


endothelial cells 


Strategene 


EDT001 


3 6-7 9-10 


fetal brain 


Clontech 


FBR006 


3 10-11 


fetal brain 


GIBCO 


HFB001 


3 


fetal heart 


Invitrogen 


FHR001 


9 


fetal kidney 


Clontech 


FKD002 


7 


fetal liver-spleen 


Soares 


FLS001 


359 


fetal liver-spleen 


Soares 


FLS002 


3-6 9 


fetal lung 


Invitrogen 


FLG003 


1-2 6-7 


fetal muscle 


Invitrogen 


FMS001 


3811 


fetal muscle 


Invitrogen 


FMS002 


6 


fetal skin 


Invitrogen 


FSK001 


16 


infant brain 


Soares 


IB2002 


139 


leukocytes 


GIBCO 


LUC001 


9 


lymphocyte 


ATCC 


LPC001 


9 


mix 


B/I/C 


SUP008 


9 


mix 


B/I/C 


SUP009 


3 


mixed 




CGdOll 


4 


mixed 




CGd012 


2468 11 


neuron 


Strategene 


NTD001 


9 


ovary 


Invitrogen 


AOV001 


379 


prostate 


Clontech 


PRT001 


7 


rectum 


Invitrogen 


REC001 


3 


salivary gland 


Clontech 


SAL001 


2 


skeletal muscle 


Clonetech 


SKMS03 


8 


skeletal muscle 


Clontech 


SKM001 


38 11 


small intestine 


Clontech 


SIN001 


8 


thymus 


Clontech 


THMc02 


1 


trachea 


Clontech 


TRC001 


3 


umbilical cord 


BioChain 


FUC001 


36 


uterus 


Clontech 


UTR001 


1 


young liver 


GIBCO 


ALV001 


9 
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TABLE 2 



PCT/US02/08964 



SEO ID 


Hit ID 


Species 


Description 


S score 


Percentage 
identity 


12 


gil 1275980 


Homo sapiens 


NOTCH 1 (Nl)mRNA, 
complete cds. 


14525 


99 


12 


ei338675 


Homo sapiens 


Human TAN-1 mRNA 
(homologue of Drosophila 
Notch gene), 5' end. 


13918 


99 


12 


gi3 123675 


Rattus rattus 


rat notch protein 


13122 


89 


13 


gi2370133 


Homo sapiens 


partial MUC5B gene, exon 30- 
48. 


3553 


98 


13 


gi2290534 


Homo sapiens 


Human sublingual gland mucin 
(MUC5B) mRNA, partial cds. 


3494 


99 


13 


pil2843546 


Mus musculus 


putative 


2265 


62 


14 


AAB47140 


Homo sapiens 


INCY- CDIFF-21, Incyte ID 
No. 1360522CD1. 


959 


100 


14 


AAB37424 


Homo sapiens 


ROSE/ Human secreted protein 
BLAST search protein SEQ ID 
NO: 134. 


959 


100 


14 


AAB36111 


Homo sapiens 


CURA- Human MTC50. 


959 


100 


15 


gi4377339 


Chlamydophila 

pneumoniae 

CWL029 


Methionine Aminopeptidase 


80 


34 


15 


gi8979382 


Chlamydophila 
pneumoniae J 138 


methionine aminopeptidase 


80 


34 


15 


gi9800317 


rat 

cytomegalovirus 
Maastricht 


pR102 


78 


29 


16 


gil5081392 


Homo sapiens 


NAC1 protein mRNA, complete 
cds. 


2742 


100 


16 


gi2384732 


Rattus 
norvegicus 


NAC-1 protein 


2328 


87 


16 


gil2849997 


Mus musculus 


putative 


2325 


87 


17 


AAH46863_aal 


Homo sapiens 


MILL- Human 23553 sulfatase 
^polypeptide encoding cDNA. 


3137 


100 


17 


AAB85483 


Homo sapiens 


MILL- Human 23553 sulfatase 
polypeptide. 


3137 


100 


17 


gil 5394725 


Homo sapiens 


unnamed protein product 


3137 


100 


18 


gil 283 8404 


Mus musculus 


putative 


1581 


90 


18 


AAU18309 


Homo sapiens 


HUMA- Human endocrine 
polypeptide SEQ ID No 264. 


1507 


99 


18 


AAU19635 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 285. 


679 


97 


19 


gi!2844321 


Mus musculus 


putative 


2979 


92 


19 


eil6552019 

A 


Homo sapiens 


cDNA FLJ32015 fis, clone 
NTONG1000052, weakly 
similar to Rattus norvegicus 
mRNA for Kelch related protein 
1. 


2284 


99 


19 


gil 63068 13 


Homo sapiens 


clone MGC: 1367 
IMAGE:2959774, mRNA, 
complete cds. 


1715 


50 


20 


AAB95294 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 175 19. 


2432 


100 
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SEQID 


Hit ID 


Species 


Description 


S score 


Percentage 

luciiuiy 


20 


gil0434931 


Homo sapiens 


cDNA FLJ13081 fis, clone 
NT2RP3002033. 


2432 


100 


20 


AAB85773 


Homo sapiens 


INC Y- Human drug 
metabolizing enzyme (ID No. 
19323521D1). 




00 
yy 


21 


gil82925 


Homo sapiens 


Human gamma amino butyric 
acid (GABAA) receptor beta-3 
subunit mRNA, complete cds. 


2348 


99 


21 


gil4714965 


Homo sapiens 


ganima-aminobutyric acid 
jvjAoaj a receptor, oeia 
clone MGC:9051 
IMAGE:3871 111, mRNA, 
complete cos. 


2341 


99 


21 


■nr r i eft 

gi755159 


"T7 i 

Mus musculus 


vjAxSA-oenzoaiazepme receptor 
beta-3 subunit 




07 

y * 




A AMOS 1 18 


Homo ^aniens 


HUMA- Human reoroductive 
system related antigen SEQ ID 
NO: 3776. 


650 


84 


22 


gil2838772 


Mus musculus 


putative 


478 


55 


22 


ABB 12186 


Homo sapiens 


HYSE- Human AQ homologue, 
SEQIDNO:2556. 


352 


64 
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TABLE 3 



PCT/US02/08964 



SEQ ID 

NO: 


Database 
entry ID 


Description 


Results* 


12 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 8.500e-ll 1959-1972 


12 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 7.164e-10 1066-1087 
PR00764F 16.89 7.164e-10 1190-1211 


12 


PF00023 


Ank repeat proteins. 


PF00023B 14.20 6.000e-10 1962-1972 
PF0OO23A 16.03 7.375e-10 2066-2082 


12 


PR00021 


SMALL PROLINE-RICH PROTEIN 
SIGNATURE 


PR00021A4.31 1.342e-09 2404-2417 


12 


BL00612 


Osteonectin domain proteins. 


BL00612B 11.35 U79e-09 1310-1343 


12 


RT 01 IKS 


P-terminal cv^tine knot nroteins 


BLOl 185B 21.14 2.636e-ll 1140-1189 
BL01185B 21.14 8.948e-ll 747-796 
BL01185B 21.14 5.667e-10 940-989 
BLOl 185B 21.14 3.118e-09 291-340 


12 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 3.475e-09 761-789 


12 


PR00680 


P-TYPE TREFOIL DOMAIN 
SIGNATURE 


PR00680B 10.34 7.682e-09 1506-1519 


12 


PR00009 


TYPE I EGF SIGNATURE 


PR00009C 14.11 8.059e-09 659-671 


12 


PF00791 


Domain present in ZO-1 and Unc5-like 
netrin receptors. 


PF00791B 28.49 9.214e-10 2033-2088 
PF00791B 28.49 8.514e-09 2000-2055 


12 


BLOl 187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 5.154e-16 1245-1261 
BL01187B 12.04 8.650e-14 429-445 
BL01187B 12.04 9.100e-14 885-901 
BL01187B 12.04 2.565e-13 999-1015 
BL01187B 12.04 5.304e-13 312-328 
BL01187B 12.04 2.667e-12 1161-1177 
BL01187B 12.04 5.667e-12 693-709 
BL01187B 12.04 7.333e-12 730-746 
BL01187B 12.04 7.667e- 12 655-671 
BL01187B 12.04 9.000e-12 961-977 
BL01187B 12.04 1.600e-ll 467-483 
BL01187B 12.04 4.300e-ll 1037-1053 
BL01187B 12,04 4.900e-ll 543-559 
BLOl 187A 9.98 2.714e-10 944-956 
BL01187B 12.04 2.800e-10 580-596 
BL01I87B 12.04 3.314e-10 505-521 
BL01187A 9.98 4.857e-10 1182-1194 
BL01187B 12.04 6.143e-10 350-366 
BL01187B 12.04 7.943e-10 272-288 
BL01187B 12.04 8.457e-10 1123-1139 
BL01187A 9.98 8.714e-10 450-462 
BL01187B 12.04 9.486e-10 155-171 
BL01187A 9.98 2.500e-09 1266-1278 
BL01187B 12.04 3.700e-09 806-822 
BL01187B 12.04 3.925e-09 1199-1215 
BLOl 187A 9.98 4.000e-09 868-880 
BL01187B 12.04 6.400e-09 195-211 
BL01187A 9.98 8.500e-09 488-500 
BL01187A 9.98 8.500e-09 564-576 
BL01187B 12.04 8.875e-09 1075-1091 
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♦Results include in order: accession number subtype; raw score; p-value; position of signature 



amino acid sequence 



SEQID 


Database 


Description 


Results* 


NO: 


entry ID 






12 


DM00060 


338 kw NEUREXIN ALPHA III 
CYSTEINE. 


DM00060 6.92 7.750e-12 684-694 
DM00060 6.92 1.750e-l 1 914-924 
DM00060 6.92 7.250e-l 1 759-769 
DM00060 6.92 9.750e- 11 146-156 
DM00060 6.92 6.850e-10 108-118 
DM00060 6.92 4.780e-09 224-234 
DM00060 6.92 4.960e-09 303-313 
DM00060 6.92 9.100e-09 1314-1324 


12 


PD00919 


CALCIUM-BINDING PRECURSOR 
SIGNAL R. 


PD00919A 11.53 1. 8 18e-09 339-351 
PD00919A 11.53 5.227e-09 1150-1162 
PD00919A 11.53 6.045e-09 144-156 
PD00919A 11.53 6.318e-09 532-544 
PD00919A 1 1.53 8.500e-09 912-924 
PD00919A 1 1.53 9.182e-09 950-962 


12 


PR00011 


TYPE m EGF-LIKE SIGNATURE 


PROOOl ID 14.03 9.852e-ll 1287-1306 
PROOOllB 13.08 5.174e-10 236-255 
PROOOllB 13.08 3.466e-09 1287-1306 
PROOOl ID 14.03 7.261e-09 545-564 
PROOOl ID 14.03 7.391e-09 236-255 
PROOOl 1 A 14.06 8.151e-09 236-255 
PROOOl ID 14.03 8.174e-09 582-601 
PROOOllB 13.08 9.260e-09 1125-1144 


12 


BL00279 


Membrane attack complex components 
/ perforin proteins. 


BL00279E37.il 5.385e-ll 1049-1097 
BL00279E37.il 7.923e-ll 1173-1221 
BL00279E37.il 3.711e-10 129-177 
BL00279E37.il 5.012e- 10 742-790 
BL00279E37.il 8.048e- 10 441-489 
BL00279E 37.11 2.047e-09 1011-1059 
BL00279E 37.11 9.267e-09 897-945 
BL00279E37.il 9.58U-09 1338-1386 


12 


PR00010 


TYPE n EGF-LIKE SIGNATURE 


PROOOIOC 11.16 7.882e-13 1250-1261 
PROOOIOC 11.16 7.750e-12 1004-1015 
PROOOlOA 11.79 1.692e-ll 791-803 
PROOOlOA 11.79 3.077e-ll 452-464 
PROOOIOC 11.16 3.333e-ll 966-977 
PROOOlOA 11.79 2.452e-10 528-540 
PROOOlOA 1 1.79 3.323e-10 678-690 
PROOOIOC 11.16 3.455e-10 698-709 
PROOOlOA 11.79 3.613e-10 946-958 
PROOOlOA 11.79 7.677e-10 1184-1196 
PROOOlOA 11.79 9.419e-10 490-502 
PROOOIOC 11.16 1.214e-09 1331-1342 
PROOOlOB 10.30 2.059e-09 690-698 
PROOOIOC 11.16 2.929e-09 434-445 
PROOOlOB 10.30 3.647e-09 996-1004 
PROOOIOC 11.16 4.214e-09 160-171 
PROOOIOC 11.16 4.429e-09 890-901 
PROOOIOC 11.16 5.500e-09 83-94 
PROOOIOC 11.16 6.143e-09 585-596 
PROOOIOC 11.16 7.214e-09 1166-1177 
PROOOIOC 11.16 7.429e-09 510-521 
PROOOIOC 1 1.16 8.500e-09 355-366 
PROOOIOC 11.16 9.571e-09 548-559 



114 



WO 02/077180 PCT/US02/08964 



SEQ ED 
NO: 


Database 
entry ID 


Description 


Results* 








PR00010C 11.16 9.786e-09 660-671 


12 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 3.250e-lU 1254-1261 
BL00022B 7.54 4.0UUe-Uy 1UUO-1U15 
BL00022A 7.48 9.000e-09 687-694 

T5T AAA77D 7 Q 1 flfio flO 1*57 174 

BL00022B 7.54 9.100e-09 398-405 


13 


BL01185 


C-terminal cystine knot proteins. 


BL01 185D 23.45 8.043e-19 580-633 


13 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


nnAnoiP 17 <A < nn<» 1 1 7Q1_illO 


13 


PR00802 


SERUM ALBUMIN FAMILY 
SIGNATURE 


PR00802C 12.28 6. 885e- 10 470-488 


16 


PF00651 


BTB (also known as BR-C/Ttk) 
domain proteins. 


PF00651 15.00 l.OOOe- 10 43-56 


17 


BL00523 


Sulfatases proteins. 


n t nno'i'C in T7 O iic a 1/1 077 'iAT 
BL0U523b \y.Zl o.lz5e-l** 2/ /-3U/ 

BL00523C 12.64 4.000e-13 145-156 
BL00523A 13.36 7.300e-13 53-70 
BL00523B 8.64 6.1 14e-l 1 99-1 1 1 

"DT f\AC7ir> O CO 7 174#» HO 77^ 74.7 


18 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 1.000e-ll 181-197 


18 


PF00791 


Domain present m ZO-1 and Unc5-like 
netrin receptors. 


DCAA7Q11} *>C ylO 7 771*» 1 1 7 AO "XC\A 

rrUU/yio Zo.4y z.Z/je-ll ZHVou*t 
PF00791C 20.98 9.165e-10 263-302 
PF00791B 28.49 9.650e-10 181-236 
PF00791B 28.49 3.312e-09 148-203 


19 


PF00651 


BTB (also known as BR-C/Ttk) 
domain proteins. 


rrUlKOl 15.UU Z.ZoOe-lU *fO-jy 


21 


PR00253 


GAMMA-AMENOBUTYRIC ACID 
(GABA) RECEPTOR SIGNATURE 


PR00253A 9.15 5.714e-24 246-267 
PR00253C 13.85 9.500e-23 306-328 
PR00253B 13.47 3.14ie-zz Z/z-Zy4 
PR00253D 16.68 7.000e-22 451-472 


21 


PR00252 


NEUROTRANSMITTER-GATED 
ION CHANNEL FAMILY 


PR00252C 17.49 3.739e-15 161-176 
PR00252A 14.28 4.115e-13 83-100 
PR00252D 12 29 7 750e-ll 237-250 
PR00252B 15.17 3.250e-10 115-127 


21 


BL00236 


Neurotransmitter-gated ion-channels 
proteins. 


BL00236D 25.66 1.857e-32 230-272 
BL00236C 25.16 2.227e-31 146-185 
BL00236A 21.96 4.682e-24 63-101 
BL00236B 14.67 9.625e-10 116-126 



TABLE 4 



SEQ ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


12 


EGF 


EGF-like domain 


2.3e-271 


914.9 


37 


24-57:63-98:106- 
138:144-175:182- 
215:222-254:261- 
292:299-332:339- 
370:376-409:416- 
449:456-487:494- 
525:532-563:570- 
600:607-638:645- 
675:682-713:720- 
750:757-788:795- 
826:833-867:874- 
905:912-943:950- 
981:988-1019:1026- 
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SEQ n> 


Model 


Description 


E-value 


Score 


Repeats 


Position 














1057:1064- 
1095:1111- 
1143:1150- 
1181:1188- 
lZiy.lZZo- 
1265:1272- 
1305:1312- 
1346:1353- 
1384:1392- 

14ZO.OZ;J-1DOZ | 


12 


ank 


Ankyrin repeat 


5e-47 


169.7 


6 


1881-1926:1928- 

iyoU:lyol- 

1994:1995- 

ZUZ 1 .ZUZo- 

2060:2061-2093 


12 


notch 


[Notch (UoLj aomain 


7 /lo 7Q 

z.*»eoy 


1AA 9 


i 


14*0-l*#ol . l*»oO- 

1523:1524-1563 


12 


disinte 
grin 


Disintegrin 


2.4 


-25.5 


1 


811-863 


12 


Keratin 

Til 


Keratin, high sulfur B2 
protein 


7.5 


-82.0 


1 


103-258 


12 


Metall 

othio 

PEC 


Plant PEC family 
metallothionein 


8.3 


-40.6 


1 


1244-1307 


13 


vwc 


von WilleDrana iactor 
type C domain 


4./e-uo 




i 


481:486-548 


1 1 
13 


Cyskn 

Ol 


Cystine-knot domain 


j.*ie-UD 


9A 7 
ZO./ 


i 
i 


JHI-OJJ 


13 


vwd 


von Willebrand factor 


0.0009 


18.2 


1 


8-133 


13 


TIL 


Trypsin Inhibitor like 
cysteine rich domain 


1.1 


-4.2 


1 


254-309 


10 


DTD 

r> Id 


DTD/DH7 <4nmiin 

u 1 D/rUL aomain 


i.yeou 


I l*f .0 


i 
i 


90-194 


17 


Sulfata 
se 


Sulfatase 


2.9e-94 


326.6 


1 


53-479 


18 


ank 


Ankyrin repeat 


1.6e-48 


174.6 


5 


143-175:176- 
208:209-241:243- 

97£«977 "*flQ 
Z /O.Z / /-juy 


1 C 


rnJ 


.—. ,. — 

Fibronectin type III 
domain 




Q 7 


i 
i 


O 07 


19 


Kelch 


Kelch motif 


1.7e-36 


134.7 


5 


348-399:401- 

d40 -451 -407 *4QQ 
•fry .*t j i / .*ty y- 

544:546-600 


19 


BTB 


BTB/POZ domain 


8.4e-35 


129.0 


1 


23-128 


91 
Li 


Ncui_c 
han L 
BD 


M ai i r extra n c m i tte* r- <* a t& A 
Vi CUiUUolloilli UC1 -galCU 

ion-channel lig 


1 . vC-O I 


9R4 ^ 


1 


37-243 


21 


Neur_c 
han_m 
emb 


Neurotransmitter-gated 
ion-channel tra 


3.4e-67 


236.7 


1 


250-468 
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Table 5 



SE 

Q 

ID 

NO 


PD 
B 
ID 


CHA 
IN 
ID 


STA 
RT 
AA 


EN 
D 

AA 


Psi 
Bla 

St 


Verif 

y 

score 


PMF 
score 


SEQFO 
LD 
score 


Coumpoun 
d 


PDB annotation 


12 


la3 
d 




1069 


121 
9 


l.9e 
-22 


0.01 


-0.09 




PHOSPHO 
LIPASE 

A2; 

CHAIN: 
NULL; 


CARBOXYLIC ESTER 
HYDROLASE 
PHOSPHOLIPASE, 
TR1MER, CALCIUM 
BINDING, 

ACTIVATOR SITE, 2 
CARBOXYLIC ESTER 
HYDROLASE 


12 


laut 


L 


1057 


115 
8 


7.6e 
-21 


0.34 


0.13 




ACTIVAT 
ED 

PROTEIN 
C; CHAIN: 
C, L; D- 
PHE-PRO- 
MAI; 

CHAIN: P; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) 

AUTOPROTHROMBIN 
HA; HYDROLASE, 
SERINE 
PROTEINASE), 
PLASMA CALCIUM 
BINDING, 2 
GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) 


12 


laut 


L 


126 


230 


7.6e 
-25 


0.37 


0.76 




ACTIVAT 
ED 

PROTEIN 
C; CHAIN: 
C, L; D- 
PHE-PRO- 
MAI; 

CHAIN: P; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) 

AUTOPROTHROMBIN 
IIA; HYDROLASE, 
SERINE 
PROTEINASE), 
PLASMA CALCIUM 
BINDING, 2 
GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) 


12 


laut 


L 


1294 


139 
8 


5.7e 
-16 


0.21 


-0.03 




ACTIVAT 
ED 

PROTEIN 
C; CHAIN: 
C. L: D- • 
PHE-PRO- 
MAI; 

CHAIN: P; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) 

AUTOPROTHROMBIN 
IIA; HYDROLASE, 
SERINE 
PROTEINASE), 
PLASMA CALCIUM 
BINDING, 2 
GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) 


12 


laut 


L 


738 


846 


5.7e 
-23 


0.27 


0.16 




ACTIVAT 
ED 

PROTEIN 
C; CHAIN: 
C, L; D- 
PHE-PRO- 
MAI; 

CHAIN: P; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) 

AUTOPROTHROMBIN 
IIA; HYDROLASE, 
SERINE 
PROTEINASE), 
PLASMA CALCIUM 
BINDING, 2 
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GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) 


12 


laut 


L 


826 


920 


1.5e 
-22 


0.31 


0.84 




ACTIVAT 
ED 

PROTEIN 
C; CHAIN: 
C, L; D- 
PHE-PRO- 
MAI; 

CHAIN: P; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) 

AUTOPROTHROMBIN 
IIA; HYDROLASE, 
SERINE 
PROTEINASE), 
PLASMA CALCIUM 
BINDING, 2 
GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) 


12 


law 
c 


B 


1929 


208 


1.9e 
-41 


0.32 


1.00 




GA 

BINDING 
PROTEIN 
ALPHA; 
CHAIN: A; 
GA 

BINDING 
PROTEIN 
BETA 1; 
CHAIN: B; 
DNA; 
CHAIN: D, 
E; 


COMPLEX 

(TRANSCRIPTION 

REGULATION/DNA) 

GABPALPHA; 

GABPBETA1; 

COMPLEX 

(TRANSCRIPTION 

REGULATION/DNA), 

DNA-BINDING, 2 

NUCLEAR PROTEIN, 

ETS DOMAIN, 

ANKYRIN REPEATS, 

TRANSCRIPTION 3 

FACTOR 


12 


lbd 
8 




1874 


201 
9 


L9e 
-26 


0.46 


1.00 




P19INK4D 
CDK4/6 
INHIBITO 
R; CHAIN: 
NULL; 


TUMOR SUPPRESSOR 
TUMOR 

SUPPRESSOR, CDK4/6 
INHIBITOR, ANKYRIN 
MOTIF 


12 


lbd 
8 




1929 


208 
4 


1.9e 
-40 


0.16 


1.00 




P19INK4D 
CDK4/6 
INHIBITO 
R; CHAIN: 
NULL; 


TUMOR SUPPRESSOR 
TUMOR 

SUPPRESSOR, CDK4/6 
INHIBITOR, ANKYRIN 
MOTIF 


12 


tbi 
7 


B 


1995 


211 
4 


lie 
•28 


0.34 


1.00 




CYCLIN- 

DEPENDE 

NT 

KINASE 6; 
CHAIN: A; 
MULTIPL 
E TUMOR 
SUPPRESS 
OR; 

CHAIN: B; 


COMPLEX 
(KINASE/ANTI- 
ONCOGENE) CDK6; 
P16INK4A, MTS1; 
CYCLIN DEPENDENT 
KINASE, CYCLIN 
DEPENDENT KINASE 
INHIBITORY 2 
PROTEIN, CDK, INK4, 
CELL CYCLE, 
MULTIPLE TUMOR 
SUPPRESSORS MTS1, 
COMPLEX 
(KINASE/ANTI- 
ONCOGENE) HEADER 


12 


Ibl 

X 


B 


1874 


205 
4 


7.6e 
-27 


0.13 


0.99 




CYCLIN- 
DEPENDE 


COMPLEX 
(INHIBITOR 
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NT 

KINASE 6; 
CHAIN: A; 
P19INK4D 
; CHAIN: 
B; 


PROTEIN/KINASE) 
INHIBITOR PROTEIN, 
CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 
2 CONTROL, 
ALPHA/BETA, 
COMPLEX 
(INHIBITOR 
PROTEIN/KINASE) 


12 


lbl 

X 


B 


1931 


208 
7 


1.9e 
-39 


-0.01 


1.00 




CYCLIN- 

DEPENDE 

NT 

KINASE 6; 
CHAIN: A; 
P19INK4D 
; CHAIN: 
B; 


COMPLEX 

(INHIBITOR 

PROTEIN/KINASE) 

INHIBITOR PROTEIN, 

CYCLIN-DEPENDENT 

KINASE, CELL CYCLE 

2 CONTROL, 

ALPHA/BETA, 

COMPLEX 

(INHIBITOR 

PROTEIN/KINASE) 


12 


lbl 

X 


B 


1962 


211 
7 


9.5e 
-36 


0.44 


1.00 




CYCLIN- 

DEPENDE 

NT 

KINASE 6; 
CHAIN: A; 
P19INK4D 
; CHAIN: 
B; 


COMPLEX 

(INHIBITOR 

PROTEIN/KINASE) 

INHIBITOR PROTEIN, 

CYCLIN-DEPENDENT 

KINASE, CELL CYCLE 

2 CONTROL, 

ALPHA/BETA, 

COMPLEX 

(INHIBITOR 

PROTEIN/KINASE) 


12 


Icej 


A 


1312 


138 
4 


9.5e 
-15 


0.39 


0.15 




MEROZOl 
TE 

SURFACE 
PROTEIN 
1; CHAIN: 
A; 


SURFACE PROTEIN 
MEROZOITE 
SURFACE ANTIGEN 1, 
MAJOR BLOOD- 
STAGE EGF-LIKE 
DOMAIN, 

EXTRACELLULAR, 
MODULAR PROTEIN, 
SURFACE 2 ANTIGEN, 
MALARIA VACCINE 
COMPONENT, 
SURFACE PROTEIN 


12 


Icej 


A 


791 


908 


3.8e 
-18 


-0.05 


0.06 




MEROZOl 
TE 

SURFACE 
PROTEIN 
1 ; CHAIN: 
A; 


SURFACE PROTEIN 
MEROZOITE 
SURFACE ANTIGEN 1, 
MAJOR BLOOD- 
STAGE EGF-LIKE 
DOMAIN, 

EXTRACELLULAR, 
MODULAR PROTEIN, 
SURFACE 2 ANTIGEN, 
MALARIA VACCINE 
COMPONENT, 
SURFACE PROTEIN 


12 


Icej 


A 


946 


102 
8 


l.9e 
-19 


0.02 


-0.18 




MEROZOl 
TE 

SURFACE 


SURFACE PROTEIN 
MEROZOITE 
SURFACE ANTIGEN 1, 
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PROTEIN 
1; CHAIN: 
A; 


MAJOR BLOOD- 
STAGE EGF-LIKE 
DOMAIN, 

EXTRACELLULAR, 
MODULAR PROTEIN, 
SURFACE 2 ANTIGEN, 
MALARIA VACCINE 
COMPONENT, 
SURFACE PROTEIN 


1 L 


MO 
lay 

S 


A 
r\ 


1 700 


21 1 

u\ 1 

4 


5.7e 
-34 


0.07 


0.95 




CYCLIN- 

DEPENDE 

NT 

KINASE 4 
INHIBITO 
RB; 

CHAIN: A; 


SIGNALING PROTEIN 
HELIX -TURN-HELIX, 
ANKYRIN REPEAT 


12 


Ida 
n 


L 


1024 


116 
3 


7.6e 
-23 


0.14 


-0.05 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 

H; 

SOLUBLE 
TISSUE 
FACTOR; 
CHAIN: T, 
U; D-PHE- 
PHE-ARG- 
CHLORO, 
METHYL 
KETONE 
(DFFRCM 
K) WITH 
CHAIN: C; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


1075 


120 
1 


7.6e 
-26 


0.35 


-0.02 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 

H; 

SOLUBLE 
TISSUE 
FACTOR; 
CHAIN: T, 
U; D-PHE- 
PHE-ARG- 
CHLORO 
METHYL 
KETONE 
(DFFRCM 
K) WITH 
CHAIN: C; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


1123 


123 
7 


9.5e 
-26 


0.32 


0.24 




BLOOD 
COAGUL 
ATION 
FACTOR 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
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o 


B 


IN 


RT 


D 


Bla 


y 


score 


LD 


d 




ID 


ID 


ID 


AA 


AA 


St 


score 




score 






NO 








































VIIA; 


FACTOR, 2 




















CHAIN: L, 


RECEPTOR ENZYME, 




















H; 


INHIBITOR, GLA, 




















SOLUBLE 


EGF, 3 COMPLEX 




















TISSUE 


(SERINE 




















FACTOR; 


PROTEASE/COFACTO 




















CHAIN: T, 


R/LIGAND) 




















U; D-PHE- 






















PHE-ARG- 






















CHLORO 






















METHYL 






















KETONE 






















(DFFRCM 






















K) WITH 






















CHAIN: C; 




12 


Ida 
n 


L 


1161 


128 
5 


7.6e 
-29 


0.06 


-0.12 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 

H; 

SOLUBLE 
TISSUE 
FACTOR; 
CHAIN: T, 
U; D-PHE- 
PHE-ARG- 
CHLORO 
METHYL 
KETONE 
(DFFRCM 
K) WITH 
CHAIN: C; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


125 


227 


1.5e 
-26 


-0.15 


0.43 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 

H; 

SOLUBLE 
TISSUE 
FACTOR; 
CHAIN: T, 
U; D-PHE- 
PHE-ARG- 
CHLORO 
METHYL 
KETONE 
(DFFRCM 
K) WITH 
CHAIN: C; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


150 


274 


3.8e 
-25 


0.09 


-0.01 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
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CHAIN: L, 
H; 

SOLUBLE 
TISSUE 
FACTOR; 
CHAIN: T, 
U; D-PHE- 
PHE-ARG- 
CHLORO 
METHYL 
KETONE 
(DFFRCM 
K) WITH 
CHAIN: C; 


RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


312 


423 


9.5e 
-26 


0.22 


-0.06 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 

H; 

SOLUBLE 

TISSUE 

FACTOR; 

CHAIN: T, 

U; D-PHE- 

PHE-ARG- 

CHLORO 

METHYL 

KETONE 

(DFFRCM 

K)WITH 

CHAIN: C; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


31 


149 


lie 

-22 


0.20 


0.23 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 

H; 

SOLUBLE 
TISSUE 
FACTOR; 
CHAIN: T, 
U; D-PHE- 
PHE-ARG- 
CHLORO 
METHYL 
KETONE 
(DFFRCM 
K) WITH 
CHAIN: C; 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
INHIBITOR, GLA, 
EGF, 3 COMPLEX 
(SERINE 

PROTEASE/COFACTO 
R/LIGAND) 


12 


Ida 
n 


L 


505 


620 


3.8e 
-31 


-0.32 


0.00 




BLOOD 

COAGUL 

ATION 

FACTOR 

VIIA; 

CHAIN: L, 


BLOOD 

COAGULATION, 
SERINE PROTEASE, 
COMPLEX, CO- 
FACTOR, 2 
RECEPTOR ENZYME, 
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o 


B 


IN 


RT 


D 


Bla 


y 


score 


LD 


d 




ID 


ID 


ID 


AA 


AA 


St 


score 




score 






NO 








































H; 


INHIBITOR, GLA, 




















SOLUBLE 


EGF, 3 COMPLEX 




















TISSUE 


(SERINE 




















FACTOR; 


PROTEASE/COFACTO 




















CHAIN: T, 


R/LIGAND) 




















U; D-PHE- 






















PHE-ARG- 






















CHLORO 






















METHYL 






















KETONE 






















(DFFRCM 






















K) WITH 






















CHAIN: C; 




12 


ldc 

q 


A 


1922 


208 
8 


1.9c 
-29 


-0.26 


0.45 




PYK2- 

ASSOCIA 

TED 

PROTEIN 
BETA; 
CHAIN: A; 


METAL BINDING 
PROTEIN ZrNC- 
BINDING MODULE, 
ANKYRIN REPEATS, 
METAL BINDING 
PROTEIN 


12 


Idv 
a 


L 


1019 


110 
3 


1.9e 
-20 


0.25 


0.31 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
I; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 
M; (DPN)- 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 

E-76; 

CHAIN: X, 
Y; 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 


12 


ldv 
a 


L 


1305 


139 
8 


5.7e 
-18 


0.48 


0.09 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAJN); 
CHAIN: H, 
I; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 
M; (DPN)- 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 
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E-76; 

CHAIN: X, 
Y; 




12 


ldv 
a 


L 


1391 


151 

6 


1.9e 
•08 


-0.00 


-0.11 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
I; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 
M; (DPN)- 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 
E-76; 

CHAIN: X, 
Y; 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 


12 


ldv 
a 


L 


449 


545 


7.6e 
-26 


-0.16 


0.19 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
I; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 
M; (DPN)- 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 
E-76; 

CHAIN: X, 
Y; 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 


12 


ldv 
a 


L 


675 


762 


1.7e 
-26 


0.31 


0.53 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
1; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 
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M; (DPN)- 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 
E-76; 

CHAIN: X, 
Y; 




12 


ldv 
a 


L 


905 


100 
1 


7.6e 
-28 


-0.00 


0.31 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
I; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 
M; (DPN)- 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 
E-76; 

CHAIN: X, 
Y; 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 


12 


ldv 
a 


L 


943 


103 
1 


3.8e 
-27 


0.28 


0.12 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
I; DES- 
GLA 
FACTOR 
VIIA 
(LIGHT 
CHAIN); 
CHAIN: L, 
M; (DPN> 
PHE-ARG; 
CHAIN: C, 
D; 

PEPTIDE 
E-76; 

CHAIN: X, 
Y; 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 


12 


ldv 
a 


L 


981 


106 
9 


l.le 
-23 


0.05 


0.34 




DES-GLA 
FACTOR 
VIIA 
(HEAVY 
CHAIN); 
CHAIN: H, 
I; DES- 
GLA 


HYDROLASE/HYDRO 
LASE INHIBITOR 
PROTEIN-PEPTIDE 
COMPLEX 
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o 


B 


IN 


RT 


D 


Bla 


y 


score 


LD 


d 




ID 


CD 


ID 


AA 


AA 


St 


score 




score 






NO 








































FACTOR 






















VIIA 






















(LIGHT 






















CHAIN); 






















CHAIN: L, 






















M- (DPN)- 






















PHE-ARG; 






















CHAIN: C, 






















D; 






















PEPTIDE 






















E-76; 






















CHAIN: X, 






















Y; 




12 


Idx 
5 


I 


1064 


118 
1 


1.9e 
-21 


0.36 


-0.08 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B, C, D; 
THROMBI 
N HEAVY 
CHAIN; 
CHAIN: 
M, N,0,P; 
THROMB 
OMODULI 
N; CHAIN: 

I. J. K,L; 

THROMBI 

N 

INHIBITO 
R L-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F, G, H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


Idx 
5 


I 


1309 


142 
7 


1.3e 
-17 


0.63 


-0.12 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B, C, D; 
THROMBI 
N HEAVY 
CHAIN; 
CHAIN: 
M,N, 0,P; 
THROMB 
OMODULI 
N; CHAIN: 
I, J, K, L; 
THROMBI 
N 

INHIBITO 
R L-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F,G, H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR n; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


Idx 


I 


221 


332 


3.8e 


0.19 


0.03 




THROMBI 


SERINE PROTEINASE 
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5 








-29 








N LIGHT 

CHAIN; 

CHAIN: A, 

B,C,D; 

THROMBI 

N HEAVY 

CHAIN; 

CHAIN: 

M, N, O, P; 

THROMB 

OMODULI 

N; CHAIN: 

U,K,L; 

THROMBI 

N 

INHIBITO 
R L-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F, G, H; 


COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CDI41 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


ldx 
5 


I 


411 


532 


5.7e 
-28 


-0.17 


0.40 




THROMBI 

N LIGHT 

CHAIN; 

CHAIN: A, 

B,C,D; 

THROMBI 

N HEAVY 

CHAIN; 

CHAIN: 

M, N, O, P; 

THROMB 

OMODULI 

N; CHAIN: 

U,K,L; 

THROMBI 

N 

INHIBITO 
R L-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F r G ? H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; j 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


ldx 
5 


I 


492 


600 


5.7e 
-28 


0.18 


0.43 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B,C, D; 
THROMBI 
N HEAVY 
CHAIN; 
CHAIN: 
M, N, O, P; 
THROMB 
OMODULI 
N; CHAIN: 
UK, L; 
THROMBI 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 
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N 

INHIBITO 
R L-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F, G, H; 




12 


ldx 
5 


I 


58 


177 


1.5e 
-25 


0.17 


-0.13 




THROMBI 

N LIGHT 

CHAIN; 

CHAIN: A, 

B,C,D; 

THROMBI 

N HEAVY 

CHAIN; 

CHAIN: 

M, N, O, P; 

THROMB 

OMODULI 

N; CHAIN: 

I,J,K,L; 

THROMBI 

N 

INHIBITO 
R L-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F, G, H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


ldx 

5 


I 


641 


750 


3.8e 
-26 


0.07 


0.49 




THROMBI 

N LIGHT 

CHAIN; 

CHAIN: A, 

B,C,D; 

THROMBI 

N HEAVY 

CHAIN; 

CHAIN: 

M,N,0,P; 

THROMB 

OMODULI 

N; CHAIN: 

U. K,L; 

THROMBI 

N 

INHIBITO 
RL-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F, G, H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


ldx 
5 


I 


717 


826 


3.8e 
-26 


0.19 


0.13 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B, C, D; 
THROMBI 
N HEAVY 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
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CHAIN; 
CHAIN: 
M, N, O, P; 
THROMB 
OMODULI 
N; CHAIN: 
I, J, K,L; 
THROMBI 
N 

INHIBITO 
RL-GLU- 
L-GLY-L- 
ARM; 
CHAIN: E, 
F, G, H; 


EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


ldx 
5 


1 


870 


981 


Lie 
-27 


0.06 


0.01 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B, C, D; 
THROMBI 
N HEAVY 
CHAIN; 
CHAIN: 
M, N, O, P; 
THROMB 
OMODULI 
N; CHAIN: 
I.J, K, L; 
THROMBI 
N 

INHIBITO 

RL-GLU- 

L-GLY-L- 

ARM; 

CHAIN: E, 

F,G,H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


ldx 
5 


I 


945 


105 

7 


Lie 
-29 


0.20 


0.21 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B. C, D; 
THROMBI 
N HEAVY 
CHAIN; 
CHAIN: 
M, N, O, P; 
THROMB 
OMODULI 
N; CHAIN: 
I, J, K, L; 
THROMBI 
N 

INHIBITO 
R L-GLU- 
L-GLY-I^ 
ARM; 
CHAIN: E, 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD 141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 
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F, G, H; 




12 


ldx 
5 


I 


987 


109 
2 


1.9e 
-24 


0.14 


■0.06 




THROMBI 
N LIGHT 
CHAIN; 
CHAIN: A, 
B,C, D; 
THROMBI 
N HEAVY 
CHAIN; 
CHAIN: 
M, N, O, P; 
THROMB 
OMODULI 
N; CHAIN: 
I. IKU 
THROMBI 
N 

INHIBITO 

RL-GLU- 

L-GLY-L- 

ARM; 

CHAIN: E, 

F,G,H; 


SERINE PROTEINASE 
COAGULATION 
FACTOR II; 
COAGULATION 
FACTOR II; 
FETOMODULIN, TM, 
CD 141 ANTIGEN; 
EGR-CMK SERINE 
PROTEINASE, EGF- 
LIKE DOMAINS, 
ANTICOAGULANT 
COMPLEX, 2 
ANTIFIBRINOLYTIC 
COMPLEX 


12 


lhr 
e 




978 


103 
9 


9.5e 
-19 


0.17 


0.01 




GROWTH 

FACTOR 

HEREGUL 

IN-ALPHA 

(EPIDERM 

AL 

GROWTH 
FACTOR- 
LIKE 

DOMAIN) 
IHRE 3 
(NMR, 
MINIMIZE 
D 

AVERAGE 
STRUCTU 
RE) IHRE 
4 




12 


Hh 
b 


A 


1884 


205 
2 


7.6e 
-27 


0.48 


1.00 




CYCLIN- 

DEPENDE 

NT 

KINASE 6 
INHIBITO 

R; CHAIN: 
A, B; 


CELL CYCLE 
INHIBITOR PI 8- 
INK4C(INK6); CELL 
CYCLE INHIBITOR, 
P18-INK4C(INK6), 
ANKYRIN REPEAT, 2 
CDK 4/6 INHIBITOR 


I £, 


1 ill 

b 


A 
t\ 


i y j i 


208 
4 


9.5e 
-37 


0.33 


1.00 




CYCLIN- 

DEPENDE 

NT 

KINASE 6 
INHIBITO 
R; CHAIN: 
A.B; 


CELL CYCLE 
INHIBITOR PI 8- 
INK4C(INK6); CELL 
CYCLE INHIBITOR, 
P18-INK4C(INK6), 
ANKYRIN REPEAT, 2 
CDK 4/6 INHIBITOR 


12 


Ikl 

0 




1276 


143 
5 


5.7e 
-24 


0.17 


-0.01 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 



130 



WO 02/077180 PCTYUS02/08964 



SE 
Q 
ID 
NO 


PD 
B 
ID 


CHA 
IN 
ID 


STA 
RT 
AA 


EN 
D 

AA 


Psi 
Bla 

St 


Verif 

y 

score 


PMF 
score 


SEQFO 
LD 
score 


Coumpoun 
d 


PDB annotation 


12 


lkl 

0 




1316 


152 
4 


5.7e 
-19 


0.04 


-0.14 




LAM IN IN; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




148 


312 


I.7e 
-31 


0.30 


-0.13 




LAM IN IN; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




226 


429 


3.8e 
-30 


0.20 


0.01 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




306 


467 


1.9e 
-29 


0.09 


-0.06 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




31 


195 


3.8e 
-26 


0.24 


0.25 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




498 


694 


3.8e 
-32 


0.22 


0.35 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




686 


846 


1.7e 
-31 


0.41 


0.49 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




799 


961 


7.6e 
-32 


0.13 


0.05 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lkl 

0 




916 


108 
5 


l.le 
-34 


0.28 


0.74 




LAMININ; 

CHAIN: 

NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN 


12 


lm 
yo 




1884 


201 

3 


1.5e 
-23 


0.40 


1.00 




MYOTRO 
PHIN; 
CHAIN: 
NULL 


ANK-REPEAT 
MYOTROPHIN, 
ACETYLATION, NMR, 
ANK-REPEAT 


12 


lm 
yo 




1930 


203 
7 


3.8e 
-25 


0.14 


0.46 




MYOTRO 
PHIN; 
CHAIN: 
NULL 


ANK-REPEAT 
MYOTROPHIN, 
ACETYLATION, NMR, 
ANK-REPEAT 


12 


lm 
yo 




1998 


211 
2 


1.9e 
-32 


-0.08 


0.74 




MYOTRO 
PHIN; 
CHAIN: 
NULL 


ANK-REPEAT 
MYOTROPHIN, 
ACETYLATION, NMR, 
ANK-REPEAT 


12 


Infi 


E 


1881 


209 
8 


9.5e 
-50 


0.42 


1.00 




NF- 

KAPPA-B 
P65; 

CHAIN: A, 
C; NF- 
KAPPA-B 
P50; 

CHAIN: B, 
D; I- 

KAPPA-B- 
ALPHA; 
CHAIN: E, 
F; 


COMPLEX 
(TRANSCRIPTION 
REG/ANK REPEAT) 
COMPLEX 
(TRANSCRIPTION 
REGULATION/ ANK 
REPEAT), ANKYRJN 2 
REPEAT HELIX 


12 


lpf 

X 


L 


1066 


121 
0 


7.6e 
-39 


0.17 


0-00 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) CHRISTMAS 
FACTOR; COMPLEX, 
INHIBITOR, 
HEMOPHILIA/EGF, 
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BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


1150 


129 
7 


3.8e 
-41 


0.09 


-0.14 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

H EMOPHILI A/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


1190 


133 

7 


1.9e 
-37 


0.20 


-0.05 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


1236 


137 
5 


1.5e 
-37 


0.11 


-0.03 




FACTOR 
DCA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


1274 


141 

7 


9.5e 
-34 


0.19 


0.00 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
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GLYCOPROTEIN 


12 


Ipf 

X 


L 


146 


283 


5.7e 
-41 


0.03 


-0.09 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


i P f 

X 


L 


189 


323 


1.9e 
-38 


0.28 


-0.12 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE j 
PROTEASE, 

CALCIUM-BINDING, | 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


Ipf 

X 


L 


222 


361 


5.7e 
-41 


0.25 


-0.06 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


Ipf 

X 


L 


378 


516 


5.7e 
-41 


-o.n 


0.03 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


Ipf 

X 


L 


458 


592 


3.8e 
-43 


-0.35 


0.09 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 


COMPLEX (BLOOD 
COAGULATION/INHI 
BITOR) CHRISTMAS 
FACTOR; COMPLEX, 
INHIBITOR, 
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CHAIN: I; 


HEMOPHILIA/EGF, 
BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


65 


207 


1.9c 
-38 


0.20 


-0.13 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


ipf 

X 


L 


836 


972 


5.7e 
-41 


0.04 


0.01 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


877 


101 
0 


1.9e 
-42 


0.27 


0.05 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


915 


104 
8 


5.7e 
-41 


0.05 


-0.06 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
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score 
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score 
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PDB annotation 






















HYDROLASE, 3 
GLYCOPROTEIN 


12 


lpf 

X 


L 


952 


108 

6 


3.8e 
-40 


0.17 


0.46 




FACTOR 
IXA; 

CHAIN: C, 
L,; D-PHE- 
PRO-ARG; 
CHAIN: I; 


COMPLEX (BLOOD | 

COAGULATION/INHI 

BITOR) CHRISTMAS 

FACTOR; COMPLEX, 

INHIBITOR, 

HEMOPHILIA/EGF, 

BLOOD 

COAGULATION, 2 
PLASMA, SERINE 
PROTEASE, 
CALCIUM-BINDING, 
HYDROLASE, 3 
GLYCOPROTEIN 


12 


lqf 
k 


L 


1025 


116 
1 


5.7e 
-24 


0.12 


-0.11 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 

R; CHAIN: 
C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


1117 


119 
9 


3.8e 
-26 


0.28 


0.25 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


k 


L 


1149 


124 
6 


3.8e 
-28 


0.03 


0.15 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 
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PDB annotation 




















CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 




12 


lqf 

k 


L 


1187 


128 
8 


1.9e 
-29 


0.11 


0.60 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


1312 


140 
4 


1.9e 
-21 


0.24 


0.05 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 

C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


181 


273 


1.3e 
-27 


0.56 


0.21 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 
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PDB annotation 




















CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 




12 


lqf 
k 


L 


298 


387 


9.5c 
-26 


-0.13 


0.11 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


339 


467 


Lie 
-22 


0.05 


0.00 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 

c; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


415 


508 


9.5e 
-29 


0.05 


0.27 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 
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PDB annotation 


n 

V 


f» 

0 


IN 


RT 


D 


Bla 


v 
J 


score 


LD 


d 




in 
it/ 


in 


in 
it/ 




AA 


st 


score 




score 






NO 








































COAGUL 






















ATION 






















FACTOR 






















VIIA 






















(HEAVY 






















CHAIN); 






















CHAIN: H; 






















TRIPEPTI 






















DYL 






















INHIBITO 






















R; CHAIN: 






















c; 




12 


lqf 
k 


L 


493 


592 


3.8e 
-29 


-0.05 


0.78 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


531 


618 


9.5e 
-30 


0.46 ; 


0.37 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


63 


155 


1.7e 
-23 


0.39 


0.11 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 
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ATION 

FACTOR 

VIIA 

(HEAVY 

CHAINV 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
Cj 




12 


lqf 
k 


L 


756 


846 


1.7e 
-29 


0.35 


0.78 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAINV 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 

k i 


L 


912 


100 
0 


3.8e 
-31 


-0.02 


0.47 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 

c; 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


lqf 
k 


L 


949 


103 
7 


3.8e 
-31 


0.20 


0.13 | 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION I 


SERINE PROTEASE 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 
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FACTOR 

VIIA 

(HEAVY 

CHAIN); 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 

C; 




12 


Iqf 
k 


L 


988 


107 
5 


3.8e 
-30 


0.05 


0.72 




COAGUL 

ATION 

FACTOR 

VIIA 

(LIGHT 

CHAIN); 

CHAIN: L; 

COAGUL 

ATION 

FACTOR 

VIIA 

(HEAVY 

CHAINV 

CHAIN: H; 

TRIPEPTI 

DYL 

INHIBITO 
R; CHAIN: 
C; 


SERINE PROTEASE j 
FVIIA; FVIIA; BLOOD 
COAGULATION, 
SERINE PROTEASE 


12 


Isw 
6 


A 


1874 


207 
0 


3.8e 
-26 


0.05 


0.72 




REGULAT 

ORY 

PROTEIN 

SWI6; 

CHAIN: A, 

B; 


TRANSCRIPTION 

REGULATION 

TRANSCRIPTION 

REGULATION, 

ANKYRIN REPEATS, 

CELL-CYCLE 


12 


lur 
k 




1312 


145 
8 


3.8e 
-15 


0.13 


-0.14 




PLASMIN 
OGEN 
ACnVATI 
ON 

PLASMIN 
OGEN 
ACTIVAT 
OR 

(UROKIN 

ASE- 

TYPE) 

(AMINO 

TERMINA 

L 

FRAGME 
NT) (NMR, 
15 

STRUCTU 
RES) 




12 


lur 

k ! 




757 


920 


5.7e 
-17 


0.07 


-0.17 




PLASMIN 
OGEN 
ACTIVATI 
ON 





140 



WO 02/077180 PCT/US02/08964 



SE 

Q 

ID 

NO 


PD 
B 
ID 


CHA 
IN 
ID 


STA 
RT 
AA 


EN 
D 

AA 


Psi 
Bla 

St 


Verif 

y 

score 


PMF 

score 


SEQFO 
LD 
score 
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PLASMIN 
OGEN 
ACTIVAT 
OR 

(UROKIN 

ASE- 

TYPE) 

(AMINO 

TERM FN A 

L 

FRAGME 
NT) (NMR, 
15 

STRUCTU 
RES) 




12 


lva 
P 


A 


1059 


118 
8 


1.9e 
-30 


0.08 


-0.09 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, 
LIPID 

DEGRADATION, 
HYDROLASE 


12 


lva 
P 


A 


144 


261 


1.7e 
-29 


0.16 


-0.12 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, 
LIPID 

DEGRADATION, 
HYDROLASE 


12 


lva 
P 


A 


306 


456 


7.6e 
-28 


0.01 


-0.14 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, 
LIPID 

DEGRADATION, 
HYDROLASE 


12 


lva 
P 


A 


31 


144 


5.7e 
-23 


0.04 


-0.13 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, 
LIPID 

DEGRADATION, 
HYDROLASE 


12 


lva 
P 


A 


687 


795 


Lie 
-28 


0.00 


-0.12 




PHOSPHO 
LIPASE 

A2; 

CHAIN: A, 
B; 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, 
LIPID 

DEGRADATION, 1 
HYDROLASE 


12 


lvp 

I 




106 


222 


9.5e 
-30 


0.02 


-0.18 




PHOSPHO 

LIPASE 

A2 

INHIBITO 
R; CHAIN: 
NULL 


NEUROTOXIN 
PHOSPHOLIPASE A2 
INHIBITOR, X-RAY 
STRUCTURE, 
RECOGNITION, 2 
MOLECULAR 
EVOLUTION, 
NEUROTOXIN 


12 


lvp 




1238 


135 
3 


9.5e 
-30 


0.22 


-0.17 




PHOSPHO 

LIPASE 

A2 

INHIBITO 
R; CHAIN: 
NULL 


NEUROTOXIN 
PHOSPHOLIPASE A2 
INHIBITOR, X-RAY 
STRUCTURE, 
RECOGNITION, 2 
MOLECULAR 
EVOLUTION, 
NEUROTOXIN 


12 


lvp 

i 




988 


no 

2 


3.8e 
-31 


0.13 


-0.17 




PHOSPHO 
LIPASE 


NEUROTOXIN 
PHOSPHOLIPASE A2 
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A2 

INHIBITO 
R; CHAIN: 
NULL 


INHIBITOR, X-RAY 

STRUCTURE, 

RECOGNITION, 2 

MOLECULAR 

EVOLUTION, 

NEUROTOXIN 


1 9 


1 Wil 

e 




ojo 


908 


1 7p 

1 • / c 

-16 




-O OS 

*V.UJ 




COAGUL 
ATION 
FACTOR 
X; CHAIN: 
NULL; 


GLYCOPROTEIN 
GLYCOPROTEIN, 
HYDROLASE, SERINE 
PROTEASE, PLASMA, 
BLOOD 2 
COAGULATION 
FACTOR 


1 X 


1 wh 

I WIl 

e 






106 
1 


1.9e 
-19 


0.36 


-0.08 




COAGUL 

V-/ jV \-J V_/ 

ATION 
FACTOR 
X; CHAIN: 
NULL; 


G LYCOPROTEIN 
GLYCOPROTEIN, 
HYDROLASE, SERINE 
PROTEASE, PLASMA, 
BLOOD 2 
COAGULATION 
FACTOR 


12 


Ixk 
a 


L 


1149 


124 
8 


9.5e 
-25 


0.29 


0.36 




BLOOD 

COAGUL 

ATION 

FACTOR 

XA; 

CHAIN: L, 
C; 


BLOOD 

COAGULATION 
FACTOR STUART 
FACTOR- BLOOD 
COAGULATION 
FACTOR, SERINE 
PROTEINASE, 
EPIDERMAL 2 
GROWTH FACTOR 
LIKE DOMAIN 


12 


Ixk 
a 


L 


1312 


140 
4 


3.8e 
-18 


0.29 


0.13 




BLOOD 

COAGUL 

ATION 

FACTOR 

XA; 

CHAIN: L, 
C; 


BLOOD 

COAGULATION 
FACTOR STUART 
FACTOR* BLOOD 

k i\ V-^ A AVA^V^V^A^ 

COAGULATION 
FACTOR, SERINE 
PROTEINASE, 
EPIDERMAL 2 
GROWTH FACTOR 
LIKE DOMAIN 


12 


Ixk 
a 


L 


144 


233 


3.8e 
-25 


0.38 


0.90 




BLOOD 

COAGUL 

ATION 

FACTOR 

XA; 

CHAIN: L, 
C; 


BLOOD 

COAGULATION 
FACTOR STUART 
FACTOR- BLOOD 
COAGULATION 
FACTOR, SERINE 
PROTEINASE, 
EPIDERMAL 2 
GROWTH FACTOR 
LIKE DOMAIN 


12 


Ixk 


L 


756 


844 


1.3e 
-26 


0.33 


0.48 




BLOOD 

COAGUL 

ATION 

FACTOR 

XA; 

CHAIN: L, 
C; 


BLOOD 

COAGULATION 
FACTOR STUART 
FACTOR; BLOOD 
COAGULATION 
FACTOR, SERINE 
PROTEINASE, 
EPIDERMAL 2 
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ID 
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score 
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score 


LD 

score 


f'rt i i m w% mi n 

Luumpuuu 
d 


pnn onnrttotinn 






















GROWTH FACTOR 
LIKE DOMAIN 


12 


lxk 
a 


L 


833 


925 


1.9e 
-24 


0.24 


0.53 




BLOOD 

COAGUL 

ATION 

XA; 

CHAIN: L, 
C; 


BLOOD 

COAGULATION 
FACTOR STUART 

COAGULATION 
FACTOR, SERINE 
PROTEINASE, 
EPIDERMAL 2 
GROWTH FACTOR 
LIKE DOMAIN 


12 


lxk 

a 


L 


949 


103 
9 


3.8e 
-28 


0.15 


0.99 




BLOOD 

COAGUL 

ATION 

PAPTHD 

r AL, l UK 
XA; 

CHAIN: L, 

C; 


BLOOD 

COAGULATION 
FACTOR STUART 

COAGULATION 
FACTOR, SERINE 
PROTEINASE, 
EPIDERMAL 2 
GROWTH FACTOR 
LIKE DOMAIN 


12 


lyc 

s 


B 


1931 


209 
3 


1.9e 
-27 


-0.17 


0.64 




P53; 

CHAIN: A; 
53BP2; 
CHAIN: B; 


COMPLEX (ANTI- 
ONCOGENE/ANKYRI 
N REPEATS) P53BP2; 
ANKYRIN REPEATS, 
SH3,P53, TUMOR 
SUPPRESSOR, 
MULTIGENE 2 
FAMILY, NUCLEAR 

rt\\J 1 CI IN , 

PHOSPHORYLATION, 
DISEASE MUTATION, 
3 POLYMORPHISM, 
COMPLEX (ANTI- 
ONCOGENE/ANKYRI 
N REPEATS) 


12 


lyc 
s 


B 


1998 


211 
4 


1.7e 
-28 


0.10 


0.87 




P53; 

CHAIN: A; 
53BP2; 
CHAIN: B; 


COMPLEX (ANTI- 
ONCOGENE/ANKYRI 
N REPEATS) P53BP2; 
ANKYRIN REPEATS, 
SH3, P53, TUMOR 
SUPPRESSOR, 
MULTIGENE 2 
FAMILY, NUCLEAR 

rKKJ 1C1IN, 

PHOSPHORYLATION, 
DISEASE MUTATION, 
3 POLYMORPHISM, 
COMPLEX (ANTI- 
ONCOGEN E/ANKYRI 
N REPEATS "1 


12 


2no 
t 


A 


1117 


121 
9 


5.7e 
-31 


0.05 


-0.11 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


HYDROLASE 
HYDROLASE, LIPID 
DEGRADATION, 
CALCIUM, 
PRESYNAPTIC 2 
NEUROTOXIN, 
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VENOM 


1 7 


t 


A 


529 


638 


5.7e 
-30 


0.16 


-0.1 1 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


HYDROLASE 

HYDROLASE, LIPID 

DEGRADATION, 

CALCIUM, 

PRESYNAPTIC 2 

NEUROTOXIN, 

VENOM 


1 ? 


2no 
t 


A 

A 


U 1 


i i\j 


1 7e 

-25 


-ft AC) 


0.06 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


HYDROLASE 

HYDROLASE, LIPID ! 

DEGRADATION, 

CALCIUM, 

PRESYNAPTIC 2 

NEUROTOXIN, 

VENOM 


I A, 


2no 
t 


A 
t\ 


677 


789 


3.8e 
-32 


-0.15 


0.01 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


HYDROLASE 

HYDROLASE, LIPID 

DEGRADATION, 

CALCIUM, 

PRESYNAPTIC 2 

NEUROTOXIN, 

VENOM 


1 9 
XL 


2no 
t 


A 




5 


5.7e 
-30 


0.26 


-0.09 




PHOSPHO 

LIPASE 

A2; 

CHAIN: A, 
B; 


HYDROLASE 

HYDROLASE, LIPID 

DEGRADATION, 

CALCIUM, 

PRESYNAPTIC 2 

NEUROTOXIN, 

VENOM 


12 


9wg 
a 


A 


1045 


125 
7 


1.9e 
-40 


0.13 


-0.12 




LECTIN 

(AGGLUTI 

NIN) 

WHEAT 

GERM 

AGGLUTI 

NIN 

(ISOLECTI 

N2) 

9WGA3 




12 


9wg 
a 


A 


1186 


137 
5 
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CLAIMS 

WHAT IS CLAIMED IS: 

1. An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1 - 1 1, a mature protein coding portion of SEQ ID NO: 1 - 1 1, an 

5 active domain coding protein of SEQ ID NO: 1 - 1 1 , and complementary sequences thereof. 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein 
said polynucleotide has greater than about 90% sequence identity with the polynucleotide of 
claim 1 . 

0 

3. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 



4. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

15 

5. A vector comprising the polynucleotide of claim 1 . 

6. An expression vector comprising the polynucleotide of claim 1 . 

20 7. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

25 

9. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of 
a polypeptide encoded by any one of the polynucleotides of claim 1 . 

10. A composition comprising the polypeptide of claim 9 and a carrier. 

30 

11. An antibody directed against the polypeptide of claim 9. 



12. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 
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a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

5 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

1 0 b) amplifying a product comprising at least a portion of the polynucleotide 

of claim 1 ; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 

15 14. The method of claim 13, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 
polynucleotide. 

15. A method for detecting the polypeptide of claim 9 in a sample, comprising: 

20 a) contacting the sample with a compound that binds to and forms a 

complex with the polypeptide under conditions and for a period sufficient to form the 
complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 9 is detected. 

25 

16. A method for identifying a compound that binds to the polypeptide of claim 9, 
comprising: 

a) contacting the compound with the polypeptide of claim 9 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 9 is identified. 



17. A method for identifying a compound that binds to the polypeptide of claim 9, 
comprising: 
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a) contacting the compound with the polypeptide of claim 9, in a cell, 
under conditions sufficient to form a polypeptide/compound complex, wherein the complex 
drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, 
5 so that if the polypeptide/compound complex is detected, a compound that binds to the 

polypeptide of claim 9 is identified. 

18. A method of producing the polypeptide of claim 9, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
10 from the group consisting of a polynucleotide sequence of SEQ ID NO: 1-1 1, a mature protein 

coding portion of SEQ ID NO: 1-1 1, an active domain coding portion of SEQ ID NO: 1-1 1, 
complementary sequences thereof, under conditions sufficient to express the polypeptide in 
said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

15 

19. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of any one of the polypeptides from the Sequence Listing, the mature protein 
portion thereof, or the active domain thereof. 

20 20. The polypeptide of claim 21 wherein the polypeptide is provided on a polypeptide 
array. 

21. A collection of polynucleotides, wherein the collection comprising the sequence 
information of at least one of SEQ ID NO: 1-11. 

25 

22. The collection of claim 21, wherein the collection is provided on a nucleic acid array. 

23. The collection of claim 22, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 

30 24. The collection of claim 22, wherein the array detects mismatches to any one of the 
polynucleotides in the collection. 

25. The collection of claim 21, wherein the collection is provided in a computer-readable 
format. 
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26. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 9 or 19 and a 
pharmaceutical^ acceptable carrier. 

27. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that specifically binds 
to a polypeptide of claim 9 or 19 and a pharmaceutical^ acceptable carrier. 
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25 <151> 2001-03-22 

<160> 11 

<170> Patentln version 3.0 

30 



<210> 1 
<211> 7673 
35 <212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
40 <222> (1) . . (7671) 

<400> 1 

atg ccg ccg etc ctg gcg ccc ctg etc tgc ctg gcg ctg ctg ccc 
gcg 48 

45 Met Pro Pro Leu Leu Ala Pro Leu Leu Cys Leu Ala Leu Leu Pro 
Ala 

15 10 15 



etc gec gca cga ggc ccg cga tgc tec cag ccc ggt gag ace tgc 
50 ctg 96 

Leu Ala Ala Arg Gly Pro Arg Cys Ser Gin Pro Gly Glu Thr Cys 
Leu 
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20 

aat ggc ggg aag 
tgt 144 
5 Asn Gly Gly Lys 
Cys 

35 

ggc ggg gcc ttc 
10 etc 192 

Gly Gly Ala Phe 
Leu 

50 

15 age acc ccc tgc 
aga 240 

Ser Thr Pro Cys 
Arg 
65 

20 80 

ggc gtg gca gac 

ccc 288 

Gly Val Ala Asp 
25 Pro 



etc tgc ctg aca 
cgc 336 
30 Leu Cys Leu Thr 
Arg 

100 

aac ggg ggc acc 
35 cgc 384 

Asn Gly Gly Thr 
Arg 

115 

40 tgc ccg ccc ggc 
tgc 432 

Cys Pro Pro Gly 
Cys 

130 

45 

gcc tec aac ccc 
gcc 480 

Ala Ser Asn Pro 
Ala 
50 145 
160 



25 

tgt gaa gcg gcc aat 
Cys Glu Ala Ala Asn 
40 

gtg ggc ccg cga tgc 
Val Gly Pro Arg Cys 
55 

aag aac gcc ggg aca 
Lys Asn Ala Gly Thr 
70 

tat gcc tgc age tgt 
Tyr Ala Cys Ser Cys 
85 

ccc ctg gac aac gcc 
Pro Leu Asp Asn Ala 
105 

tgc gac ctg etc acg 
Cys Asp Leu Leu Thr 
120 

tgg tea ggg aaa teg 
Trp Ser Gly Lys Ser 
135 

tgc gcc aac ggt ggc 
Cys Ala Asn Gly Gly 
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30 

ggc acg gag gcc tgc gtc 
Gly Thr Glu Ala Cys Val 
45 

cag gac ccc aac ccg tgc 
Gin Asp Pro Asn Pro Cys 
60 

tgc cac gtg gtg gac cgc 
Cys His Val Val Asp Arg 
75 

gcc ctg ggc ttc tct ggg 
Ala Leu Gly Phe Ser Gly 
90 95 
tgc etc acc aac ccc tgc 
Cys Leu Thr Asn Pro Cys 
110 

ctg acg gag tac aag tgc 
Leu Thr Glu Tyr Lys Cys 
125 

tgc cag cag get gac ccg 
Cys Gin Gin Ala Asp Pro 
140 

cag tgc ctg ccc ttc gag 
Gin Cys Leu Pro Phe Glu 
155 
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tec tac ate tgc 
egg 528 

Ser Tyr lie Cys 
Arg 

5 

cag gat gtc aac 
gga 576 
Gin Asp Val Asn 
10 Gly 

180 

ggc ace tgc cac 
gee 624 
15 Gly Thr Cys His 
Ala 

195 

ace cac act ggc 
20 ccc 672 

Thr His Thr Gly 
Pro 

210 

25 teg ccc tgc cag 
ace 720 

Ser Pro Cys Gin 
Thr 
225 
30 240 

cac gag tgt gee 

gaa 768 

His Glu Cys Ala 
35 Glu 



aat ate gac gat 
tgt 816 
40 Asn lie Asp Asp 
Cys 

260 

gtg gac ggc gtg 
45 aca 864 

Val Asp Gly Val 
Thr 

275 

50 ggt cag tac tgt 
aat 912 



cac tgc cca ccc age 
His Cys Pro Pro Ser 
165 

gag tgt ggc cag aag 
Glu Cys Gly Gin Lys 
185 

aac gag gtc ggc tec 
Asn Glu Val Gly Ser 
200 

ccc aac tgc gag egg 
Pro Asn Cys Glu Arg 
215 

aac ggg ggc ace tgc 
Asn Gly Gly Thr Cys 
230 

tgc ctg cca ggc ttc 
Cys Leu Pro Gly Phe 
245 

tgt cca gga aac aac 
Cys Pro Gly Asn Asn 
265 

aac ace tac aac tgc 
Asn Thr Tyr Asn Cys 
280 

ace gag gat gtg gac 
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ttc cat ggc ccc ace tgc 

Phe His Gly Pro Thr Cys 

170 175 

ccc agg ctt tgc cgc cac 

Pro Arg Leu Cys Arg His 

190 

tac cgc tgc gtc tgc cgc 
Tyr Arg Cys Val Cys Arg 
205 

ccc tac gtg ccc tgc age 
Pro Tyr Val Pro Cys Ser 
220 

cgc ccc acg ggc gac gtc 
Arg Pro Thr Gly Asp Val 
235 

ace ggc cag aac tgt gag 
Thr Gly Gin Asn Cys Glu 
250 255 
tgc aag aac ggg ggt gee 
Cys Lys Asn Gly Gly Ala 
270 

ccg tgc ccg cca gag tgg 
Pro Cys Pro Pro Glu Trp 
285 

gag tgc cag ctg atg cca 



WO 02/077180 

Gly Gin Tyr Cys 
Asn 

290 

5 gcc tgc cag aac 
aac 960 

Ala Cys Gin Asn 
Asn 
305 
10 320 

tgc gtg tgt gtc 
att 1008 
Cys Val Cys Val 
15 He 



gat • gac tgt gcc 
gac 1056 
20 Asp Asp Cys Ala 
Asp 

340 

cgt gtg gcc tec 
25 ctg 1104 

Arg Val Ala Ser 
Leu 

355 

30 ctg tgc cac etc 
ggc 1152 

Leu Cys His Leu 
Gly 

370 

35 

tec aac tgc gac 
tgc 1200 

Ser Asn Cys Asp 
Cys 
40 385 
400 

ccc teg ggg tac 
tgc 1248 
45 Pro Ser Gly Tyr 
Cys 



teg ctg ggt gcc 
50 acg 1296 

Ser Leu Gly Ala 
Thr 



Thr Glu Asp Val Asp 
295 

ggc ggg acc tgc cac 
Gly Gly Thr Cys His 
310 

aac ggc tgg act ggt 
Asn Gly Trp Thr Gly 
325 

age gcc gcc tgc ttc 
Ser Ala Ala Cys Phe 
345 

ttt tac tgc gag tgt 
Phe Tyr Cys Glu Cys 
360 

aac gac gca tgc ate 
Asn Asp Ala Cys He 
375 

acc aac cct gtc aat 
Thr Asn Pro Val Asn 
390 

acg ggc ccg gcc tgc 
Thr Gly Pro Ala Cys 
405 

aac ccc tgc gag cat 
Asn Pro Cys Glu His 
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Glu Cys Gin Leu Met Pro 

300 

aac acc cac ggt ggc tac 
Asn Thr His Gly Gly Tyr 
315 

gag gac tgc age gag aac 
Glu Asp Cys Ser Glu Asn 
330 335 
cac ggc gcc acc tgc cat 
His Gly Ala Thr Cys His 
350 

ccc cat ggc cgc aca ggt 
Pro His Gly Arg Thr Gly 
365 

age aac ccc tgt aac gag 
Ser Asn Pro Cys Asn Glu 
380 

ggc aag gcc ate tgc acc 
Gly Lys Ala He Cys Thr 
395 

age cag gac gtg gat gag 
Ser Gin Asp Val Asp Glu 
410 415 
gcg ggc aag tgc ate aac 
Ala Gly Lys Cys He Asn 
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420 

ctg ggc tec ttc 
cga 1344 
5 Leu Gly Ser Phe 
Arg 



tgc gag ate gac 
10 gac 1392 

Cys Glu lie Asp 
Asp 

450 

15 gec acc tgc ctg 
ccc 1440 

Ala Thr Cys Leu 
Pro 
465 
20 480 

ggc tac gag ggt 
age 1488 
Gly Tyr Glu Gly 
25 Ser 



age ccc tgc ctg 
ttc 1536 
30 Ser Pro Cys Leu 
Phe 

500 

cag tgc gag tgc 
35 gat 1584 

Gin Cys Glu Cys 
Asp 

515 

40 gtg gac gag tgt 
ctg 1632 

Val Asp Glu Cys 
Leu 

530 

45 

gac gga ccc aac 
ggg 1680 

Asp Gly Pro Asn 
Gly 
50 545 
560 



425 

gag tgc cag tgt ctg 
Glu Cys Gin Cys Leu 
440 

gtc aac gag tgc gtc 
Val Asn Glu Cys Val 
455 

gac cag att ggg gag 
Asp Gin lie Gly Glu 
470 

gtg cac tgc gag gtc 
Val His Cys Glu Val 
485 

cac aat ggc cgc tgc 
His Asn Gly Arg Cys 
505 

ccc acg ggc ttc act 
Pro Thr Gly Phe Thr 
520 

gec age acc ccc tgc 
Ala Ser Thr Pro Cys 
535 

act tac acc tgt gtg 
Thr Tyr Thr Cys Val 
550 
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430 

cag ggc tac acg ggc ccc 
Gin Gly Tyr Thr Gly Pro 
445 

teg aac ccg tgc cag aac 
Ser Asn Pro Cys Gin Asn 
460 

ttc cag tgc atg tgc atg 
Phe Gin Cys Met Cys Met 
475 

aac aca gac gag tgt gec 
Asn Thr Asp Glu Cys Ala 
490 495 
ctg gac aag ate aat gag 
Leu Asp Lys lie Asn Glu 
510 

ggg cat ctg tgc cag tac 
Gly His Leu Cys Gin Tyr 
525 

aag aat ggt gee aag tgc 
Lys Asn Gly Ala Lys Cys 
540 

tgc acg gaa ggg tac acg 
Cys Thr Glu Gly Tyr Thr 
555 



5 
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acg cac tgc gag gtg gac ate gat gag tgc gac ccc gac ccc tgc 
cac 1728 

Thr His Cys Glu Val Asp lie Asp Glu Cys Asp Pro Asp Pro Cys 
His 

5 565 570 575 

tac ggc tec tgc aag gac ggc gtc gee ace ttc acc tgc etc tgc 
cgc 1776 

Tyr Gly Ser Cys Lys Asp Gly Val Ala Thr Phe Thr Cys Leu Cys 
10 Arg 

580 585 590 

cca ggc tac acg ggc cac cac tgc gag acc aac ate aac gag tgc 
tec 1824 

15 Pro Gly Tyr Thr Gly His His Cys Glu Thr Asn lie Asn Glu Cys 
Ser 

595 600 605 

age cag ccc tgc cgc eta egg ggc acc tgc cag gac ccg gac aac 
20 gee 1872 

Ser Gin Pro Cys Arg Leu Arg Gly Thr Cys Gin Asp Pro Asp Asn 
Ala 

610 615 620 

25 tac etc tgc ttc tgc ctg aag ggg acc aca gga ccc aac tgc gag 
ate 1920 

Tyr Leu Cys Phe Cys Leu Lys Gly Thr Thr Gly Pro Asn Cys Glu 
He 

625 630 635 

30 640 

aac ctg gat gac tgt gee age age ccc tgc gac teg ggc acc tgt 
ctg 1968 

Asn Leu Asp Asp Cys Ala Ser Ser Pro Cys Asp Ser Gly Thr Cys 
35 Leu 

645 650 655 

gac aag ate gat ggc tac gag tgt gee tgt gag ccg ggc tac aca 
ggg 2016 

40 Asp Lys He Asp Gly Tyr Glu Cys Ala Cys Glu Pro Gly Tyr Thr 
Gly 

660 665 670 

age atg tgt aac age aac ate gat gag tgt gcg ggc aac ccc tgc 
45 cac 2064 

Ser Met Cys Asn Ser Asn He Asp Glu Cys Ala Gly Asn Pro Cys 
His 

675 680 685 

50 aac ggg ggc acc tgc gag gac ggc ate aat ggc ttc acc tgc cgc 
tgc 2112 
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Asn Gly Gly 
Cys 

690 

5 ccc gag ggc 
tgc 2160 
Pro Glu Gly 
Cys 
705 
10 720 

aac age aac 
ggg 2208 
Asn Ser Asn 
15 Gly 



tac aag tgc 

ate 2256 

20 Tyr Lys Cys 
lie 



aac aac aac 
25 tgc 2304 
Asn Asn Asn 
Cys 

755 

30 aaa gac atg 
age 2352 
Lys Asp Met 
Ser 

770 

35 

ggt ccc aac 
tgt 2400 

Gly Pro Asn 
Cys 
40 785 
800 

ctg aac aag 
aac 2448 
45 Leu Asn Lys 
Asn 



tgc ctg ctg 
50 ccg 2496 
Cys Leu Leu 
Pro 



Thr Cys Glu Asp 
695 

tac cac gac ccc 
Tyr His Asp Pro 
710 

ccc tgc gtc cac 
Pro Cys Val His 
725 

gac tgt gac cct 
Asp Cys Asp Pro 
740 

gag tgt gaa tec 
Glu Cys Glu Ser 

ace agt ggc ate 
Thr Ser Gly He 
775 

tgc cag acc aac 
Cys Gin Thr Asn 
790 

ggc acg tgt att 
Gly Thr Cys He 
805 

ccc tac aca ggt 
Pro Tyr Thr Gly 



Gly He Asn Gly 

acc tgc ctg tct 
Thr Cys Leu Ser 
715 

ggg gee tgc egg 
Gly Ala Cys Arg 
730 

ggg tgg agt ggg 
Gly Trp Ser Gly 
745 

aac cct tgt gtc 
Asn Pro Cys Val 
760 

gtg tgc acc tgc 
Val Cys Thr Cys 

ate aac gag tgt 
He Asn Glu Cys 
795 

gac gac gtt gee 
Asp Asp Val Ala 
810 

gee acg tgt gag 
Ala Thr Cys Glu 
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Phe Thr Cys Arg 

700 

gag gtc aat gag 
Glu Val Asn Glu 

gac age etc aac 
Asp Ser Leu Asn 
735 

acc aac tgt gac 
Thr Asn Cys Asp 
750 

aac ggc ggc acc 
Asn Gly Gly Thr 
765 

egg gag ggc ttc 
Arg Glu Gly Phe 
780 

gcg tec aac cca 
Ala Ser Asn Pro 

ggg tac aag tgc 
Gly Tyr Lys Cys 
815 

gtg gtg ctg gee 
Val Val Leu Ala 
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820 825 830 

tgt gcc ccc age ccc tgc aga aac ggc ggg gag tgc agg caa tec 
gag 2544 

5 Cys Ala Pro Ser Pro Cys Arg Asn Gly Gly Glu Cys Arg Gin Ser 
Glu 

835 840 845 

gac tat gag age ttc tec tgt gtc tgc ccc acg get ggg gcc aaa 
10 ggg 2592 

Asp Tyr Glu Ser Phe Ser Cys Val Cys Pro Thr Ala Gly Ala Lys 
Gly 

850 855 860 

15 cag acc tgt gag gtc gac ate aac gag tgc gtt ctg age ccg tgc 
egg 2640 

Gin Thr Cys Glu Val Asp lie Asn Glu Cys Val Leu Ser Pro Cys 
Arg 

865 870 875 

20 880 

cac ggc gca tec tgc cag aac acc cac ggc ggc tac cgc tgc cac 
tgc 2688 

His Gly Ala Ser Cys Gin Asn Thr His Gly Gly Tyr Arg Cys His 
25 Cys 

885 890 895 

cag gcc ggc tac agt ggg cgc aac tgc gag acc gac ate gac gac 
tgc 2736 

30 Gin Ala Gly Tyr Ser Gly Arg Asn Cys Glu Thr Asp lie Asp Asp 
Cys 

900 905 910 

egg ccc aac ccg tgt cac aac ggg ggc tec tgc aca gac ggc ate 
35 aac 2784 

Arg Pro Asn Pro Cys His Asn Gly Gly Ser Cys Thr Asp Gly lie 
Asn 

915 920 925 

40 acg gcc ttc tgc gac tgc ctg ccc ggc ttc egg ggc act ttc tgt 
gag 2832 

Thr Ala Phe Cys Asp Cys Leu Pro Gly Phe Arg Gly Thr Phe Cys 
Glu 

930 935 940 

45 

gag gac ate aac gag tgt gcc agt gac ccc tgc cgc aac ggg gcc 
aac 2880 

Glu Asp lie Asn Glu Cys Ala Ser Asp Pro Cys Arg Asn Gly Ala 
Asn 

50 945 950 955 

960 
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tgc acg gac tgc gtg gac age tac acg tgc acc tgc ccc gca ggc 
ttc 2928 

Cys Thr Asp Cys Val Asp Ser Tyr Thr Cys Thr Cys Pro Ala Gly 
Phe 

5 965 970 975 

age ggg ate cac tgt gag aac aac acg cct gac tgc aca gag age 
tec 2976 

Ser Gly lie His Cys Glu Asn Asn Thr Pro Asp Cys Thr Glu Ser 
10 Ser 

980 985 990 

tgc ttc aac ggt ggc acc tgc gtg gac ggc ate aac teg ttc acc 
tgc 3024 

15 Cys Phe Asn Gly Gly Thr Cys Val Asp Gly lie Asn Ser Phe Thr 
Cys 

995 1000 1005 

ctg tgt cca ccc ggc ttc acg ggc age tac tgc cag cac gta gtc 
20 aat 3072 

Leu Cys Pro Pro Gly Phe Thr Gly Ser Tyr Cys Gin His Val Val 
Asn 

1010 1015 1020 

25 gag tgc gac tea cga ccc tgc ctg eta ggc ggc acc tgt cag gac 
ggt 3120 

Glu Cys Asp Ser Arg Pro Cys Leu Leu Gly Gly Thr Cys Gin Asp 
Gly 

1025 1030 1035 

30 1040 

cgc ggt etc cac agg tgc acc tgc ccc cag ggc tac act ggc ccc 
aac 3168 

Arg Gly Leu His Arg Cys Thr Cys Pro Gin Gly Tyr Thr Gly Pro 
35 Asn 

1045 1050 1055 

tgc cag aac ctt gtg cac tgg tgt gac tec teg ccc tgc aag aac 
ggc 3216 

40 Cys Gin Asn Leu Val His Trp Cys Asp Ser Ser Pro Cys Lys Asn 
Gly 

1060 1065 1070 

ggc aaa tgc tgg cag acc cac acc cag tac cgc tgc gag tgc ccc 
45 age 3264 

Gly Lys Cys Trp Gin Thr His Thr Gin Tyr Arg Cys Glu Cys Pro 
Ser 

1075 1080 1085 

50 ggc tgg acc ggc ctt tac tgc gac gtg ccc age gtg tec tgt gag 
gtg 3312 
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Gly Trp Thr Gly Leu Tyr Cys Asp Val Pro Ser Val Ser Cys Glu 
Val 

1090 1095 1100 

5 get gcg cag cga caa ggt gtt gac gtt gec cgc ctg tgc cag cat 
gga 3360 

Ala Ala Gin Arg Gin Gly Val Asp Val Ala Arg Leu Cys Gin His 
Gly 

1105 1110 1115 

10 1120 

ggg etc tgt gtg gac gcg ggc aac acg cac cac tgc cgc tgc cag 
gcg 3408 

Gly Leu Cys Val Asp Ala Gly Asn Thr His His Cys Arg Cys Gin 
15 Ala 

1125 1130 1135 

ggc tac aca ggc age tac tgt gag gac ctg gtg gac gag tgc tea 
ccc 3456 

20 Gly Tyr Thr Gly Ser Tyr Cys Glu Asp Leu Val Asp Glu Cys Ser 
Pro 

1140 1145 1150 

age ccc tgc cag aac ggg gec acc tgc acg gac tac ctg ggc ggc 
25 tac 3504 

Ser Pro Cys Gin Asn Gly Ala Thr Cys Thr Asp Tyr Leu Gly Gly 
Tyr 

1155 1160 1165 

30 tec tgc aag tgc gtg gec ggc tac cac ggg gtg aac tgc tct gag 
gag 3552 

Ser Cys Lys Cys Val Ala Gly Tyr His Gly Val Asn Cys Ser Glu 
Glu 

1170 1175 1180 

35 

ate gac gag tgc etc tec cac ccc tgc cag aac ggg ggc acc tgc 
etc 3600 

lie Asp Glu Cys Leu Ser His Pro Cys Gin Asn Gly Gly Thr Cys 
Leu 

40 1185 1190 1195 

1200 

gac etc ccc aac acc tac aag tgc tec tgc cca egg ggc act cag 
ggt 3648 

45 Asp Leu Pro Asn Thr Tyr Lys Cys Ser Cys Pro Arg Gly Thr Gin 
Gly 

1205 1210 1215 

gtg cac tgt gag ate aac gtg gac gac tgc aat ccc ccc gtt gac 
50 ccc 3696 

Val His Cys Glu lie Asn Val Asp Asp Cys Asn Pro Pro Val Asp 
Pro 

10 
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1220 1225 1230 

gtg tec egg age ccc aag tgc ttt aac aac ggc acc tgc gtg gac 
cag 3744 

5 Val Ser Arg Ser Pro Lys Cys Phe Asn Asn Gly Thr Cys Val Asp 
Gin 

1235 1240 * 1245 

gtg ggc ggc tac age tgc acc tgc ccg ccg ggc ttc gtg ggt gag 
10 cgc 3792 

Val Gly Gly Tyr Ser Cys Thr Cys Pro Pro Gly Phe Val Gly Glu 
Arg 

1250 1255 1260 

15 tgt gag ggg gat gtc aac gag tgc ctg tec aat ccc tgc gac gec 
cgt 3 840 

Cys Glu Gly Asp Val Asn Glu Cys Leu Ser Asn Pro Cys Asp Ala 
Arg 

1265 1270 1275 

20 1280 

ggc acc cag aac tgc gtg cag cgc gtc aat gac ttc cac tgc gag 
tgc 3888 

Gly Thr Gin Asn Cys Val Gin Arg Val Asn Asp Phe His Cys Glu 
25 Cys 

1285 1290 1295 

cgt get ggt cac acc ggg cgc cgc tgc gag tec gtc ate aat ggc 
tgc 3936 

30 Arg Ala Gly His Thr Gly Arg Arg Cys Glu Ser Val lie Asn Gly 
Cys 

1300 1305 1310 

aaa ggc aag ccc tgc aag aat ggg ggc acc tgc gee gtg gec tec 
35 aac 3984 

Lys Gly Lys Pro Cys Lys Asn Gly Gly Thr Cys Ala Val Ala Ser 
Asn 

1315 1320 1325 

40 acc gec cgc ggg ttc ate tgc aag tgc cct gcg ggc ttc gag ggc 
gee 4032 

Thr Ala Arg Gly Phe lie Cys Lys Cys Pro Ala Gly Phe Glu Gly 
Ala 

1330 1335 1340 

45 

acg tgt gag aat gac get cgt acc tgc ggc age ctg cgc tgc etc 
aac 4 080 

Thr Cys Glu Asn Asp Ala Arg Thr Cys Gly Ser Leu Arg Cys Leu 
Asn 

50 1345 1350 1355 

1360 
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ggc ggc aca tgc ate tec ggc ccg cgc age ccc ace tgc ctg tgc 
ctg 4128 

Gly Gly Thr Cys lie Ser Gly Pro Arg Ser Pro Thr Cys Leu Cys 
Leu 

5 1365 1370 1375 

ggc ccc ttc acg ggc ccc gaa tgc cag ttc ccg gec age age ccc 
tgc 4176 

Gly Pro Phe Thr Gly Pro Glu Cys Gin Phe Pro Ala Ser Ser Pro 
10 Cys 

1380 1385 1390 

ctg ggc ggc aac ccc tgc tac aac cag ggg ace tgt gag ccc aca 
tec 4224 

15 Leu Gly Gly Asn Pro Cys Tyr Asn Gin Gly Thr Cys Glu Pro Thr 
Ser 

1395 1400 1405 

gag age ccc ttc tac cgt tgc ctg tgc ccc gec aaa ttc aac ggg 
20 etc 4272 

Glu Ser Pro Phe Tyr Arg Cys Leu Cys Pro Ala Lys Phe Asn Gly 
Leu 

1410 1415 1420 

25 ttg tgc cac ate ctg gac tac age ttc ggg ggt ggg gee ggg cgc 
gac 4320 

Leu Cys His lie Leu Asp Tyr Ser Phe Gly Gly Gly Ala Gly Arg 
Asp 

1425 1430 1435 

30 1440 

ate ccc ccg ccg ctg ate gag gag gcg tgc gag ctg ccc gag tgc 
cag 4368 

lie Pro Pro Pro Leu lie Glu Glu Ala Cys Glu Leu Pro Glu Cys 
35 Gin 

1445 1450 1455 

gag gac gcg ggc aac aag gtc tgc age ctg cag tgc aac aac cac 
gcg 4416 

40 Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn His 
Ala 

1460 1465 1470 

tgc ggc tgg gac ggc ggt gac tgc tec etc aac ttc aat gac ccc 
45 tgg 4464 

Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp Pro 
Trp 

1475 1480 1485 

50 aag aac tgc acg cag tct ctg cag tgc tgg aag tac ttc agt gac 
ggc 4512 

12 
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Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser Asp 
Gly 

1490 1495 1500 

5 cac tgt gac age cag tgc aac tea gec ggc tgc etc ttc gac ggc 
ttt 4560 

His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp Gly 
Phe 

1505 1510 1515 

10 1520 

gac tgc cag cgt gcg gaa ggc cag tgc aac ccc ctg tac gac cag 
tac 4608 

Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp Gin 
15 Tyr 

1525 1530 1535 

tgc aag gac cac ttc age gac ggg cac tgc gac cag ggc tgc aac 
age 4656 

20 Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys Asn 
Ser 

1540 1545 1550 

gcg gag tgc gag tgg gac ggg ctg gac tgt gcg gag cat gta ccc 
25 gag 4704 

Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val Pro 
Glu 

1555 1560 1565 

30 agg ctg gcg gec ggc acg ctg gtg gtg gtg gtg ctg atg ccg ccg 
gag 4752 

Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro Pro 
Glu 

1570 1575 1580 

35 

cag ctg cgc aac age tec ttc cac ttc ctg egg gag etc age cgc 
gtg 4800 

Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser Arg 
Val 

40 1585 1590 1595 

1600 

ctg cac ace aac gtg gtc ttc aag cgt gac gca cac ggc cag cag 
atg 4848 

45 Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin Gin 
Met 

1605 1610 1615 

ate ttc ccc tac tac ggc cgc gag gag gag ctg cgc aag cac ccc 
50 ate 4896 

lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His Pro 
He 

13 
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1620 



1625 
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1630 



aag cgt gcc gcc gag ggc tgg gcc gca cct gac gcc ctg ctg ggc 
cag 4944 

5 Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu Gly 
Gin 

1635 1640 1645 

gtg aag gcc teg ctg etc cct ggt ggc age gag ggt ggg egg egg 
10 egg 4992 

Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg Arg 
Arg 

1650 1655 1660 

15 agg gag ctg gac ccc atg gac gtc cgc ggc tec ate gtc tac ctg 
gag 5040 

Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser lie Val Tyr Leu 
Glu 

1665 1670 1675 

20 1680 

att gac aac egg cag tgt gtg cag gcc tec teg cag tgc ttc cag 
agt 5088 

He Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe Gin 
25 Ser 

1685 1690 1695 

gcc ace gat gtg gcc gca ttc ctg gga gcg etc gcc teg ctg ggc 
age 5136 

30 Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu Gly 
Ser 

1700 1705 1710 

etc aac ate ccc tac aag ate gag gcc gtg cag agt gag acc gtg 
35 gag 5184 

Leu Asn He Pro Tyr Lys He Glu Ala Val Gin Ser Glu Thr Val 
Glu 

1715 1720 1725 

40 ccg ccc ccg ccg gcg cag ctg cac ttc atg tac gtg gcg gcg gcc 
gcc 5232 

Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala Ala 
Ala 

1730 1735 1740 

45 

ttt gtg ctt ctg ttc ttc gtg ggc tgc ggg gtg ctg ctg tec cgc 
aag 5280 

Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser Arg 
Lys 

50 1745 1750 1755 

1760 
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cgc egg egg cag cat ggc cag etc tgg ttc cct gag ggc ttc aaa 
gtg 5328 

Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe Lys 
Val 

5 1765 1770 1775 

tct gag gec age aag aag aag egg egg gag ccc etc ggc gag gac 
tec 5376 

Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu Asp 
10 Ser 

1780 1785 1790 

gtg ggc etc aag ccc ctg aag aac get tea gac ggt gee etc atg 
gac 5424 

15 Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu Met 
Asp 

1795 1800 1805 

gac aac cag aat gag tgg ggg gac gag gac ctg gag acc aag aag 
20 ttc 5472 

Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys Lys 
Phe 

1810 1815 1820 

25 egg ttc gag gag ccc gtg gtt ctg cct gac ctg gac gac cag aca 
gac 5520 

Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin Thr 
Asp 

1825 1830 1835 

30 1840 

cac egg cag tgg act cag cag cac ctg gat gee get gac ctg cgc 
atg 5568 

His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu Arg 
35 Met 

1845 1850 1855 

tct gee atg gee ccc aca ccg ccc cag ggt gag gtt gac gee gac 
tgc 5616 

40 Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala Asp 
Cys 

1860 1865 1870 

atg gac gtc aat gtc cgc ggg cct gat ggc ttc acc ccg etc atg 
45 ate ~ 5664 

Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu Met 
He 

1875 1880 1885 

50 gee tec tgc age ggg ggc ggc ctg gag acg ggc aac age gag gaa 
gag 5712 
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Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu 
Glu 

1890 1895 1900 

5 gag gac gcg ccg gcc gtc ate tec gac ttc ate tac cag ggc gee 
age 5760 

Glu Asp Ala Pro Ala Val lie Ser Asp Phe lie Tyr Gin Gly Ala 
Ser 

1905 1910 1915 

10 1920 

ctg cac aac cag aca gac cgc acg ggc gag acc gcc ttg cac ctg 
gcc 5808 

Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu 
15 Ala 

1925 1930 1935 

gcc cgc tac tea cgc tct gat gcc gcc aag cgc ctg ctg gag gcc 
age 5856 

20 Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala 
Ser 

1940 1945 1950 

gca gat gcc aac ate cag gac aac atg ggc cgc acc ccg ctg cat; 
25 gcg 5904 

Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu His 
Ala 

1955 1960 1965 

30 get gtg tct gcc gac gca caa ggt gtc ttc cag ate ctg ate egg 
aac 5952 

Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He Arg 
Asn 

1970 1975 1980 

35 

cga. gcc aca gac ctg gat gcc cgc atg cat gat ggc acg acg cca 
ctg 6000 

Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr Pro 
Leu 

40 1985 1990 1995 

2000 

ate ctg get gcc cgc ctg gcc gtg gag ggc atg ctg gag gac etc 
ate 6048 

45 He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp Leu 
He 

2005 2010 2015 

aac tea cac gcc gac gtc aac gcc gta gat gac ctg ggc aag tec 
50 gcc 6096 

Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys Ser 
Ala 

16 
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2020 
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2030 



ctg cac tgg gcc gcc gcc gtg aac aat gtg gat gcc gca gtt gtg 
etc 6144 

5 Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val Val 
Leu 

2035 2040 2045 

ctg aag aac ggg get aac aaa gat atg cag aac aac agg gag gag 
10 aca 6192 

Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu Glu 
Thr 

2050 2055 2060 

15 ccc ctg ttt ctg gcc gcc egg gag ggc age tac gag acc gcc aag 
gtg 6240 

Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys 
Val 

2065 2070 2075 

20 2080 

ctg ctg gac cac ttt gcc aac egg gac ate acg gat cat atg gac 
cgc 6288 

Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp 
25 Arg 

2085 2090 2095 

ctg ccg cgc gac ate gca cag gag cgc atg cat cac gac ate gtg 
agg 6336 

30 Leu Pro Arg Asp He Ala Gin Glu Arg Met His His Asp He Val 
Arg 

, 2100 • 2105 2110 

ctg ctg gac gag tac aac ctg gtg cgc age ccg cag ctg cac gga 
35 gcc 6384 

Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His Gly 
Ala 

2115 2120 2125 

40 ccg ctg ggg ggc acg ccc acc ctg teg ccc ccg etc tgc teg ccc 
aac 6432 

Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro 
Asn 

2130 2135 2140 

45 

ggc tac ctg ggc age etc aag ccc ggc gtg cag ggc aag aag gtc 
cgc 6480 

Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys Val 
Arg 

50 2145 2150 2155 

2160 
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aag ccc age age aaa ggc ctg gec tgt gga age aag gag gec aag 
gac 6528 

Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys . Glu Ala Lys 
Asp 

5 2165 2170 2175 

etc aag gca egg agg aag aag tec cag gat ggc aag ggc tgc ctg 
ctg 6576 

Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu 
10 Leu 

2180 2185 2190 

gac age tec ggc atg etc teg ccc gtg gac tec ctg gag tea ccc 
cat 6624 

15 Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser Pro 
His 

2195 2200 2205 

ggc tac ctg tea gac gtg gee teg ccg cca ctg ctg ccc tec ccg 
20 ttc 6672 

Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser Pro 
Phe 

2210 2215 2220 

25 cag cag tct ccg tec gtg ccc etc aac cac ctg cct ggg atg ccc 
gac 6720 

Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met Pro 
Asp 

2225 2230 2235 

30 2240 

ace cac ctg ggc ate ggg cac ctg aac gtg gcg gee aag ccc gag 
atg 6768 

Thr His Leu Gly lie Gly His Leu Asn Val Ala Ala Lys Pro Glu 
35 Met 

2245 2250 2255 

gcg gcg ctg ggt ggg ggc ggc egg ctg gee ttt gag act ggc cca 
cct 6816 

40 Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro 
Pro 

2260 2265 2270 

cgt etc tec cac ctg cct gtg gee tct ggc ace age acc gtc ctg 
45 ggc 6864 

Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val Leu 
Gly 

2275 2280 2285 

50 tec age age gga ggg gee ctg aat ttc act gtg ggc ggg tec acc 
agt 6912 
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Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser Thr 
Ser 

2290 2295 2300 

5 ttg aat-ggt caa tgc gag tgg ctg tec egg ctg cag age ggc atg 
gtg 6960 

Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly Met 
Val 

2305 2310 2315 

10 2320 

ccg aac caa tac aac cct ctg egg ggg agt gtg gca cca ggc ccc 
ctg 7008 

Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly Pro 
15 Leu 

2325 2330 2335 

age aca cag gec ccc tec ctg cag cat ggc atg gta ggc ccg ctg 
cac 7056 

20 Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro Leu 
His 

2340 2345 2350 

agt age ctt get gec age gec ctg tec cag atg atg age tac cag 
25 ggc 7104 

Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr Gin 
Gly 

2355 2360 2365 

30 ctg ccc age acc egg ctg gee acc cag cct cac ctg gtg cag ace 
cag 7152 

Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin Thr 
Gin 

2370 2375 2380 

35 

cag gtg cag cca caa aac tta cag atg cag cag cag aac ctg cag 
cca 7200 

Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu Gin 
Pro 

40 2385 2390 2395 

2400 

gca aac ate cag cag cag caa age ctg cag ccg cca cca cca cca 
cca 7248 

45 Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro 
Pro 

2405 2410 2415 

cag ccg cac ctt ggc gtg age tea gca gee age ggc cac ctg ggc 
50 egg 7296 

Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu Gly 
Arg 

19 
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2420 2425 2430 

age ttc ctg agt gga gag ccg age cag gca gac gtg cag cca ctg 
ggc 7344 

5 Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro Leu 
Gly 

2435 2440 2445 

ccc age age ctg gcg gtg cac act att ctg ccc cag gag age ccc 
10 gec 7392 

Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser Pro 
Ala 

2450 2455 2460 

15 ctg ccc acg teg ctg cca tec teg ctg gtc cca ccc gtg acc gca 
gec 7440 

Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr Ala 
Ala 

2465 2470 2475 

20 2480 

cag ttc ctg acg ccc ccc teg cag cac age tac tec teg cct gtg 
gac 7488 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Val 
25 Asp 

2485 2490 2495 

aac acc ccc age cac cag eta cag gtg cct gag cac ccc ttc etc 
acc 7536 

30 Asn Thr Pro Ser His Gin Leu Gin Val Pro Glu His Pro Phe Leu 
Thr 

2500 2505 2510 

cct teg ccg gag teg ccc gac caa tgg teg tec teg teg ccg cac 
35 tct 7584 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His 
Ser 

2515 2520 2525 

40 aat gtg tct gac tgg tct gag ggc gtg teg teg ccc ccg acc tec 
atg 7632 

Asn Val Ser Asp Trp Ser Glu Gly Val Ser Ser Pro Pro Thr Ser 
Met 

2530 2535 2540 



45 



cag tec cag ate gcg cgc ate ccg gag gcg ttc aag taa ta 
7673 

Gin Ser Gin lie Ala Arg lie Pro Glu Ala Phe Lys 
2545 2550 2555 



50 
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<210> 2 
<211> 3228 
5 <212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
10 <222> (716) . . (2689) 

<400> 2 

tcctccactc tgggaacagc tcacaccccc aaagtggtga ccaccatggc 
cactatgccc 60 

15 

acagccactg cctccacggt tcccagctcg tccaccgtgg ggaccacccg 
cacccctgca 120 

gtgctcccca gcagcctgcc aaccttcagc gtgtccactg tgtcctcctc 
20 agtcctcacc 180 

accctgagac ccactggctt ccccagctcc cacttttcta ctccctgctt 
ctgcagggca 240 

25 tttggacagt ttttctcgcc cggggaagtc atctacaata agaccgaccg 
agccggctgc 3 00 

catttctacg cagtgtgcaa tcagcactgt gacattgacc gcttccaggg 

cgcctgtccc 360 

30 

acctccccac cgccagtgtc ctccgccccg ctgtcctcgc cctcccctgc 

ccctggctgt 420 

gacaatgcca tccctctccg gcaggtgaat gagacctgga ccctggagaa 
35 ctgcacggtg 480 

gccaggtgcg tgggtgacaa ccgtgtcgtc ctgctggacc caaagcctgt 
ggccaacgtc 540 

40 acctgcgtga acaagcacct gcccatcaaa gtgtcggacc cgagccagcc 
ctgtgacttc 600 

cactatgagt gcgagtgcat ctgcagcatg tggggcggct cccactattc 
cacctttgac 660 

45 

ggcacctctt acaccttccg gggcaactgc acctatgtcc tcatgagaga gatcc 
atg 718 

Met 

50 

1 
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cac get ttg gga ate tea gec tct acc tgg aca acc act act gca 
egg 766 

His Ala Leu Gly He Ser Ala Ser Thr Trp Thr Thr Thr Thr Ala 
Arg 

5 5 10 15 

cct ctg cca ctg ccg ctg ccg ccc get gee ccc gcg gee etc age 
ate 814 

Pro Leu Pro Leu Pro Leu Pro Pro Ala Ala Pro Ala Ala Leu Ser 
10 He 

20 25 30 

cac tac aag tec atg gat ate gtc etc act gtc acc atg gtg cat 
ggg 862 

15 His Tyr Lys Ser Met Asp He Val Leu Thr Val Thr Met Val His 
Gly 

35 40 45 

aag gag gag ggc ctg ate ctg ttt gac cca att ccg gtg age age 
20 ggt 910 

Lys Glu Glu Gly Leu He Leu Phe Asp Pro He Pro Val Ser Ser 
Gly 

50 55 60 

65 

25 

ttc age aag aac ggc gtg ctt gtg tct gtg ctg ggg acc acc acc 
atg 958 

Phe Ser Lys Asn Gly Val Leu Val Ser Val Leu Gly Thr Thr Thr 
I^t 

30 70 75 80 

get gtg gac att cct gee ctg ggc gtg age gtc acc ttc aat ggc 
caa 1006 

Ala Val Asp He Pro Ala Leu Gly Val Ser Val Thr Phe Asn Gly 
35 Gin 

85 90 95 

gtg ttc cag gee egg ctg ccc tac age etc ttc cac aac aac acc 
gag 1054 

40 Val Phe Gin Ala Arg Leu Pro Tyr Ser Leu Phe His Asn Asn Thr 
Glu 

100 105 110 

ggc cag tgc ggc acc tgc acc aac aac cag agg gac gac tgt etc 
45 cag 1102 

Gly Gin Cys Gly Thr Cys Thr Asn Asn Gin Arg Asp Asp Cys Leu 
Gin 

115 120 125 

50 egg gac gga acc act gee gee agt tgc aag gac atg gee aag acg 
tgg 1150 

22 
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Arg Asp Gly Thr 
Trp 

130 
145 

5 

ctg gtc ccc gac 
aca 1198 

Leu Val Pro Asp 
Thr 

10 

ccc ccc act gcc 
ccc 1246 
Pro Pro Thr Ala 
15 Pro 

165 

acc cca tgc cca 
gtc 1294 
20 Thr Pro Cys Pro 
Val 

180 

ttt get gag tgc 
25 gcc 1342 

Phe Ala Glu Cys 
Ala 

195 

30 tgc ate age gac 
age 1390 

Cys lie Ser Asp 
Ser 
210 
35 225 

ctg gag get tac 
gac 1438 
Leu Glu Ala Tyr 
40 Asp 



tgg cga ggt gca 
acc 1486 
45 Trp Arg Gly Ala 
Thr 

245 

aaa gtg tac aag 
50 tct 1534 

Lys Val Tyr Lys 
Ser 



Thr Ala Ala Ser Cys Lys 
135 

age aga aag gat ggc tgt 
Ser Arg Lys Asp Gly Cys 
150 155 
age ccc gca gcc ccg gtg 
Ser Pro Ala Ala Pro Val 
170 

cca cag ctg etc tgt gat 
Pro Gin Leu Leu Cys Asp 
185 

cac acc ctt ctg ccc ccg 
His Thr Leu Leu Pro Pro 
200 

cac tgc agg ggc cgc ctt 
His Cys Arg Gly Arg Leu 
215 

gca gag etc tgc cgc gcc 
Ala Glu Leu Cys Arg Ala 
230 235 
acc ggt ggc ctg tgc gac 
Thr Gly Gly Leu Cys Asp 
250 

cca tgc ggc ccc ata cag 
Pro Cys Gly Pro lie Gin 

23 
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Asp Met Ala Lys Thr 

140 

tgg gcc ccg act ggc 
Trp Ala Pro Thr Gly 
160 

tct age aca ccc acc 
Ser Ser Thr Pro Thr 
175 

ctg atg ctg age cag 
Leu Met Leu Ser Gin 
190 

ggc cca ttc ttc aac 
Gly Pro Phe Phe Asn 
205 

gag gtg ccc tgc cag 
Glu Val Pro Cys Gin 
220 

egg gga gtg tgc agt 
Arg Gly Val Cys Ser 
240 

etc acc tgc cca ccc 
Leu Thr Cys Pro Pro 
255 

cct gcc acc tgc aac 
Pro Ala Thr Cys Asn 
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260 265 270 

agg aac cag age cca cag ctg gag ggg atg gcg gag ggc tgc ttc 
tgc 1582 

5 Arg Asn Gin Ser Pro Gin Leu Glu Gly Met Ala Glu Gly Cys Phe 
Cys 

275 280 285 

cct gag gac cag ate etc ttc aac gca cac atg ggc ate tgc gtg 
10 cag 1630 

Pro Glu Asp Gin He Leu Phe Asn Ala His Met Gly He Cys Val 
Gin 

290 295 300 

305 

15 

gec tgc ccc tgc gtg gga ccc gat ggg ttt cct aaa ttt ccc ggg 
gag 1678 

Ala Cys Pro Cys Val Gly Pro Asp Gly Phe Pro Lys Phe Pro Gly 
Glu 

20 310 315 320 

egg tgg gtc age aac tgc cag tec tgc gtg tgt gac gag ggt tea 
gtg 1726 

Arg Trp Val Ser Asn Cys Gin Ser Cys Val Cys Asp Glu Gly Ser 
25 Val 

325 330 335 

teg gtg cag tgc aag ccc ctg ccc tgt gac gee cag ggt cag ccc 
ccg 1774 

30 Ser Val Gin Cys Lys Pro Leu Pro Cys Asp Ala Gin Gly Gin Pro 
Pro 

340 345 350 

ccg tgc aac cgt ccc ggc ttc gta ace gtg ace agg ccc egg gec 
35 gag 1822 

Pro Cys Asn Arg Pro Gly Phe Val Thr Val Thr Arg Pro Arg Ala 
Glu 

355 360 365 

40 aac ccc tgc tgc ccc gag acg gtg tgc gtg tgc aac aca acc ace 
tgc 1870 

Asn Pro Cys Cys Pro Glu Thr Val Cys Val Cys Asn Thr Thr Thr 
Cys 

370 375 380 

45 385 

ccc cag age ctg cct gtg tgc ccg cca ggg cag gag tec ate tgc 
acc 1918 

Pro Gin Ser Leu Pro Val Cys Pro Pro Gly Gin Glu Ser He Cys 
50 Thr 

390 395 400 
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cag gag gag ggc gac tgc tgt ccc acc ttc cgc tgc aga cct cag 
ctg 1966 

Gin Glu Glu Gly Asp Cys Cys Pro Thr Phe Arg Cys Arg Pro Gin 
Leu 

5 405 410 415 

tgt teg tac aat ggc acc ttc tac ggg gtt ggt gca acc ttc cca 
ggc 2014 

Cys Ser Tyr Asn Gly Thr Phe Tyr Gly Val Gly Ala Thr Phe Pro 
10 Gly 

420 425 430 

gec ctt ccc tgc cac atg tgt acc tgc etc tct ggg gac acc cag 
gac 2062 

15 Ala Leu Pro Cys His Met Cys Thr Cys Leu Ser Gly Asp Thr Gin 
Asp 

435 440 445 

cca acg gtg caa tgt cag gag gat gee tgc aac aat act acc tgt 
20 ccc 2110 

Pro Thr Val Gin Cys Gin Glu Asp Ala Cys Asn Asn Thr Thr Cys 
Pro 

450 455 460 

465 

25 

cag ggc ttt gag tac aag aga gtg gee ggg cag tgc tgt ggg gag 
tgc 2158 

Gin Gly Phe Glu Tyr Lys Arg Val Ala Gly Gin Cys Cys Gly Glu 
Cys 

30 470 475 480 

gtc cag acc gec tgc etc acg ccc gat ggc cag cca gtc cag ctg 
aat 2206 

Val Gin Thr Ala Cys Leu Thr Pro Asp Gly Gin Pro Val Gin Leu 
35 Asn 

485 490 495 

gaa acc tgg gtc aac age cat gtg gac aac tgc acc gtg tac etc 
tgt 2254 

40 Glu Thr Trp Val Asn Ser His Val Asp Asn Cys Thr Val Tyr Leu 
Cys 

500 505 510 

gag get gag ggt gga gtc cat ttg ctg acc cca cag cct gca tec 
45 tgc 2302 

Glu Ala Glu Gly Gly Val His Leu Leu Thr Pro Gin Pro Ala Ser 
Cys 

515 520 525 

50 cca gat gtg tec age tgc agg ggg age etc agg aaa acc ggc tgc 
tgc 2350 

25 
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Pro Asp Val Ser Ser Cys Arg Gly Ser Leu Arg Lys Thr Gly Cys 
Cys 

530 535 540 

545 

5 

tac tec tgt gag gag gac tec tgt caa gtc cgc ate aac acg acc 
ate 2398 

Tyr Ser Cys Glu Glu Asp Ser Cys Gin Val Arg lie Asn Thr Thr 
He 

10 550 555 560 

ctg tgg cac cag ggc tgc gag acc gag gtc aac ate acc ttc tgc 
gag 2446 

Leu Trp His Gin Gly Cys Glu Thr Glu Val Asn He Thr Phe Cys 
15 Glu 

565 570 575 

ggc tec tgc ccc gga gcg tec aag tac tea gca gag gee cag gec 
atg 2494 

20 Gly Ser Cys Pro Gly Ala Ser Lys Tyr Ser Ala Glu Ala Gin Ala 
Met 

580 585 590 

cag cac cag tgc acc tgc tgc cag gag agg egg gtc cac gag gag 
25 acg 2542 

Gin His Gin Cys Thr Cys Cys Gin Glu Arg Arg Val His Glu Glu 
Thr 

595 600 605 

30 gtg ccc ttg cac tgt cct aac ggc tea gee ate ctg cac acc tac 
acc 2590 

Val Pro Leu His Cys Pro Asn Gly Ser Ala He Leu His Thr Tyr 
Thr 

610 615 620 

35 625 

cac gtg gat gag tgt ggc tgc acg ccc ttc tgt gtc cct gcg ccc 
atg 2638 

His Val Asp Glu Cys Gly Cys Thr Pro Phe Cys Val Pro Ala Pro 
40 Met 

630 635 640 

get ccc cca cac acc cgt ggc ttc ccg gec cag gag gee act get 
gtc 2686 

45 Ala Pro Pro His Thr Arg Gly Phe Pro Ala Gin Glu Ala Thr Ala 
Val 

645 650 655 

tga gaac gttctgcctc catccccatg ctctgtccac ctggagccag 
50 gatgtgcatt 2743 
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gtctgatcat gaaaaccttg ggcctcctct gcggagcccc ccggcctgtg 
tgtggcaccc 2803 

5 cgcgctccgt gctcctgctg cccaccccgt gggtgaaacc ggccccagaa 
gggtgagggg 2863 

ccagcaggac cctttcggga gggcgccact caggagtcct accctgggag 
agcctgtggc 2923 

10 

ccaccttggc cttgcccctc cctgatgtca ctgggacgcc ctggaacaaa 
ctaagcatgt 2983 

gcgggcctat gtgtccctgc cacggccgga cgcccgcgca gcacggattc 
15 cagctggcca 3043 

cgtccggccg ctggggcaga caggctggtc caggcaaggc cagctgctgc 
caggaagctg 3103 

20 cgacaggcaa gcggccgcct gtccatgcct gctgcagggt aactcagggc 
tgaggtcgca 3163 

acggccaggt cagagagggg tcagcatccc aaagccccct ctgctcaacc 
cagcccagtt 3223 

25 

ttgca 
3228 



30 

<210> 3 
<211> 1375 
<212> DNA 
35 <213> Homo sapiens 

<220> 
<221> CDS 

<222> (490) . . (1044) 

40 

<400> 3 

tgcttccgcg tctagccgga gaaacttgag ccggctgccc cgcccacggt 
gcccgaagcc 60 

45 ccaaaggctg gaattagggg ctagaagtct ggcacccacc gcctggccag 
gtgttcggga 120 

cgcgaccagg tgggcggtcg ccccgccccg ggagcgcggc ttaatagctg 
agagcccggg 180 

50 

ggccaggccg cggctgcggc ccaggcaacg ccctgagggt ggccacgctg 
ccaggtgttc 240 
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cactcccccg ggactatggg caagggcccg gggcggggag ggcggcaggt 
gctgacactg 300 

5 gagctgcgcc ggaggtcggg gaactcggcc tcctaagact gaggacactc 
gcctgctggg 36 0 



10 



ccggtcgagc tgtgcggtgc cctccgggac gcagggggcg ctgcagccac 
gctgggtcag 420 

gctccgcagg gccctcccaa cccggggact aacggcgccg gtgacgactt 
cgccgcgcgt 480 



tggtcagcc atg gcc acc get etc gcg eta cgt age ttg tac cga 
15 gcg 528 

Met Ala Thr Ala Leu Ala Leu Arg Ser Leu Tyr Arg 

Ala 

15 10 

20 cga ccc teg ctg cgc tgt ccg ccc gtt gag ctt ccc tgg gcc ccg 
egg 576 

Arg Pro Ser Leu Arg Cys Pro Pro Val Glu Leu Pro Trp Ala Pro 
Arg 

15 20 25 

25 

cga ggg cat egg etc teg ccg gcg gat gac gag ctg tat cag egg 
acg 624 

Arg Gly His Arg Leu Ser Pro Ala Asp Asp Glu Leu Tyr Gin Arg 
Thr 

30 30 35 40 

45 

cgc ate tct ctg ctg caa cgc gag gcc get cag gca atg tac ate 
gac 672 

35 Arg He Ser Leu Leu Gin Arg Glu Ala Ala Gin Ala Met Tyr He 
Asp 

50 55 60 

age tac aac age cgc ggc ttc atg ata aac gga aac cgc gtg etc 
40 ggc 720 

Ser Tyr Asn Ser Arg Gly Phe Met He Asn Gly Asn Arg Val Leu 
Gly 

65 70 75 

45 ccc tgc get ctg etc ccg cac teg gtg gtg cag tgg aac gtg gga 
tec 768 

Pro Cys Ala Leu Leu Pro His Ser Val Val Gin Trp Asn Val Gly 
Ser 

80 85 90 



50 



cac cag gac ate acc gaa gac age ttt tec etc ttc tgg ttg ctg 
gag 816 
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His Gin Asp He Thr Glu Asp Ser Phe Ser Leu Phe Trp Leu Leu 
Glu 

95 100 105 

5 ccc egg ata gag ate gtg gtg gtg ggg act gga gac egg acc gag 
agg 864 

Pro Arg He Glu He Val Val Val Gly Thr Gly Asp Arg Thr Glu 
Arg 

110 115 120 

10 125 

ctg cag tec cag gtg ctt caa gee atg agg cag egg ggc att get 
gtg 912 

Leu Gin Ser Gin Val Leu Gin Ala Met Arg Gin Arg Gly He Ala 
15 Val 

130 135 140 

gaa gtg cag gac acg ccc aat gee tgt gee acc ttc aac ttc ctg 
tgt 960 

20 Glu Val Gin Asp Thr Pro Asn Ala Cys Ala Thr Phe Asn Phe Leu 
Cys 

145 150 155 

cat gaa ggc cga gta act gga get get etc ate cct cca cca gga 
25 ggg 1008 

His Glu Gly Arg Val Thr Gly Ala Ala Leu He Pro Pro Pro Gly 
Gly 

160 165 170 

30 act tea ctt aca tct ttg ggc caa get get caa tga accg 
ccaggaactg 1058 
Thr Ser Leu Thr Ser Leu Gly Gin Ala Ala Gin 
175 180 

35 acctgctgac tgcactctgc caggcttccc aatgetttea ctcttatcta 
ccctttggca 1118 



40 



ettatcttge ttatcaacat aataatttat acacttctcc cattttgtat 
caggtgtgtt 1178 

getggecagg agctgatggc tcactgggct cttggagggg aatgtgaaga 
aaccaaggag 1238 



tcactttttc atctagatta cttaggattc cttgactttt cagaagtegg 
45 gaagcagtat 12 98 

gtttgcctgt tgtagaccta cttgctcaca tgcagatttg agaggacctc 
aacggctttt 1358 

50 ctcacaaaaa aaaaaaa 
1375 
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<210> 4 
5 <211> 569 

<212> DNA 
<213> Homo sapiens 

<220> 
10 <221> CDS 

<222> (194) . . (562) 

<400> 4 

aacacctgct ttttggggtg gttgtgagga ggcaaacagt gtccagcagg 
15 cctgccccag 60 

aaccccactt catccaaggg ggctgggagg tctctgatga acccaaccat 
gtgccttcac 120 

20 agccccgtca tgtgcgtctc caggtgtgag acaatgggat gtaatgacca 
ttgttctcga 180 

tgccagtaaa gcc atg cct ggg cac atg cca aga gga ggt tgg tgt 
ctg 229 

25 Met Pro Gly His Met Pro Arg Gly Gly Trp Cys 

Leu 

15 10 

ttt aca gga cct get tgg gtt ggc agt tec cat ggg gaa ggg aag 
30 agg 277 

Phe Thr Gly Pro Ala Trp Val Gly Ser Ser His Gly Glu Gly Lys 
Arg 

15 20 25 

35 cgt ggt ggg age ttg gcc cag ctt ggc aaa caa age cag tgg gaa 
cac 325 

Arg Gly Gly Ser Leu Ala Gin Leu Gly Lys Gin Ser Gin Trp Glu 
His 

30 35 40 

40 

aag cct gtc etc etc cat ggc cag ggg tgg aaa aaa gag tgt gta 
aag 3 73 

Lys Pro Val Leu Leu His Gly Gin Gly Trp Lys Lys Glu Cys Val 
Lys 

45 45 ' 50 55 

60 

cca age cca atg ccc etc ttt cca tct gtc ccc cag atg tec tac 
get 421 

50 Pro Ser Pro Met Pro Leu Phe Pro Ser Val Pro Gin Met Ser Tyr 
Ala 

65 70 75 

30 
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teg aca aca cct ata get ttg 
cag 4 69 

Ser Thr Thr Pro lie Ala Leu 
5 Gin 

80 

tgg agg tec tgc tec ctg acg 
age 517 
10 Trp Arg Ser Cys Ser Leu Thr 
Ser 

95 

tea ccc ctg tct agg tgg etc 
15 ccc 565 

Ser Pro Leu Ser Arg Trp Leu 
110 115 

tttg 
20 569 
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tec acg cca aga agg tea get tea 

Ser Thr Pro Arg Arg Ser Ala Ser 

85 90 

agg gca tgc aga aat atg ata agg 

Arg Ala Cys Arg Asn Met lie Arg 

100 105 

cct cat ttc tea gag acc tec taa 

Pro His Phe Ser Glu Thr Ser 
120 



25 <210> 5 

<211> 1881 



<212> DNA 

<213> Homo sapiens 

30 <220> 

<221> CDS 

<222> (1) . . (1584) 

<400> 5 

35 atg gee cag aca ctg cag atg gag ate ccg aac ttc ggc aac age 
ate 48 

Met Ala Gin Thr Leu Gin Met Glu lie Pro Asn Phe Gly Asn Ser 
He 

15 10 15 

40 

ctg gag tgc etc aat gaa cag egg ctg cag ggc ctg tac tgt gac 
gtg 96 

Leu Glu Cys Leu Asn Glu Gin Arg Leu Gin Gly Leu Tyr Cys Asp 
Val 

45 20 25 30 

tea gtg gtg gtc aag ggc cat gee ttc aag gee cac egg gee gtg 
ctt 144 

Ser Val Val Val Lys Gly His Ala Phe Lys Ala His Arg Ala Val 
50 Leu 

35 40 45 



31 
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get gec age age 
age 192 

Ala Ala Ser Ser 
Ser 
5 50 

gee gtg gtg gag 

cag 240 

Ala Val Val Glu 
10 Gin 
65 

80 

ate etc age ttc 

15 gac 288 

lie Leu Ser Phe 
Asp 

20 cag ttc ctg etc 

ate 336 

Gin Phe Leu Leu 
He 

100 

25 

atg gag aag ggc 

tgc 384 

Met Glu Lys Gly 
Cys 

30 115 

gac tec cag ggc 
cag 432 
Asp Ser Gin Gly 
35 Gin 

130 

age ccc gtg gcg 
ctg 480 
40 Ser Pro Val Ala 
Leu 

145 
160 

45 ccc etc gtg teg 
gtg 528 

Pro Leu Val Ser 
Val 

50 

cag tgc atg ccc 
gag 576 



tec tac ttc egg gac 
Ser Tyr Phe Arg Asp 
55 

ctg ccg gcg get gtg 
Leu Pro Ala Ala Val 
70 

tgc tac acg ggc egg 
Cys Tyr Thr Gly Arg 
85 

atg tac acg get ggc 
Met Tyr Thr Ala Gly 
105 

ace gag ttc ttc etc 
Thr Glu Phe Phe Leu 
120 

ctg cat gcg gag gag 
Leu His Ala Glu Glu 
135 

cag aca teg ggc tgg 
Gin Thr Ser Gly Trp 
150 

c gg gtg aa g a °g g a g 

Arg Val Lys Thr Glu 
165 

gtg gee aag egg ctg 
32 
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ctg ttc aac aac age cgc 

Leu Phe Asn Asn Ser Arg 

60 

cag ccc cag tct ttc cag 
Gin Pro Gin Ser Phe Gin 
75 

ctg age atg aac gtg ggc 
Leu Ser Met Asn Val Gly 
90 95 
ttc ctg cag ate cag gag 
Phe Leu Gin He Gin Glu 
110 

aa g gtg age tec ccg age 
Lys Val Ser Ser Pro Ser 
125 

gee cca teg teg gag ccc 
Ala Pro Ser Ser Glu Pro 
140 

cca gec tgt age ace ccg 
Pro Ala Cys Ser Thr Pro 
155 

cag cag gag teg gac tec 
Gin Gin Glu Ser Asp Ser 
170 175 
tgg gac agt ggc cag aag 
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Gin Cys Met Pro Val Ala Lys Arg Leu Trp Asp Ser Gly Gin Lys 
Glu 

180 185 190 

get ggg ggc ggc ggc aat ggc age cgc aag atg gec aag ttc tec 
acg 624 

Ala Gly Gly Gly Gly Asn Gly Ser Arg Lys Met Ala Lys Phe Ser 
Thr 

195 200 205 



10 



ccg gac ctg get gee aac egg cct cac cag ccc ccg cca ccc caa 
cag 672 

Pro Asp Leu Ala Ala Asn Arg Pro His Gin Pro Pro Pro Pro Gin 
Gin 

15 210 215 220 

get ccg gtg gtg gca gca gee cag ccc gee gtg get gcg gga gca 
ggg 720 

Ala Pro Val Val Ala Ala Ala Gin Pro Ala Val Ala Ala Gly Ala 
20 Gly 

225 230 235 

240 

cag cca gec ggt ggg gtg gca gca gca ggg ggt gtg gtg agt ggg 
25 ccc 768 

Gin Pro Ala Gly Gly Val Ala Ala Ala Gly Gly Val Val Ser Gly 
Pro 

245 250 255 

30 age acg teg gag egg acc age cca ggc ace tea age gec tac acc 

age 816 

Ser Thr Ser Glu Arg Thr Ser Pro Gly Thr Ser Ser Ala Tyr Thr 
Ser 

260 265 270 

35 

gac age cct ggc tec tac cac aat gag gag gac gag gag gag gat 

ggt 864 

Asp Ser Pro Gly Ser Tyr His Asn Glu Glu Asp Glu Glu Glu Asp 
Gly 

40 275 280 285 

ggc gag gag ggc atg gat gag cag tac egg cag ate tgc aac atg 
tac 912 

Gly Glu Glu Gly Met Asp Glu Gin Tyr Arg Gin lie Cys Asn Met 
45 Tyr 

290 295 300 

acc atg tac age atg atg aac gtc ggc cag aca gec gag aag gtg 
gag 960 

50 Thr Met Tyr Ser Met Met Asn Val Gly Gin Thr Ala Glu Lys Val 
Glu 

33 
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305 310 315 

320 

gcc etc ccg gag cag gta gec ccc gag tec cga aat cgc ate egg 
5 gtt 1008 

Ala Leu Pro Glu Gin Val Ala Pro Glu Ser Arg Asn Arg He Arg 
Val 

325 330 335 

10 egg caa gac ctg gcg tct etc ccg get gaa ctt ate aac cag att 
ggg 1056 

Arg Gin Asp Leu Ala Ser Leu Pro Ala Glu Leu He Asn Gin He 
Gly 

340 345 350 

15 

aac cgc tgc cac ccc aag etc tac gac gag ggc gac ccc tct gag 
aag 1104 

Asn Arg Cys His Pro Lys Leu Tyr Asp Glu Gly Asp Pro Ser Glu 
Lys 

20 355 360 365 

ctg. gag ctg gtg aca ggc acc aac gtg tac ate aca agg gcg cag 
ctg 1152 

Leu Glu Leu Val Thr Gly Thr Asn Val Tyr He Thr Arg Ala Gin 
25 Leu 

370 375 380 

atg aac tgc cac gtc age gca ggc acg egg cac aag gtc eta ctg 
egg 12 00 

30 Met Asn Cys His Val Ser Ala Gly Thr Arg His Lys Val Leu Leu 
Arg 

385 390 395 

400 

35 egg etc ctg gcc tec ttc ttt gac egg aac acg ctg gcc aac age 
tgc 1248 

Arg Leu Leu Ala Ser Phe Phe Asp Arg Asn Thr Leu Ala Asn Ser 
Cys 

405 410 415 

40 

ggc acc ggc ate cgc tct tct acc aac gat ccc cgt egg aag ccc 
ctg 1296 

Gly Thr Gly He Arg Ser Ser Thr Asn Asp Pro Arg Arg Lys Pro 
Leu 

45 420 425 430 

gac age cgc gtg etc cac get gtc aag tac tac tgc cag aac ttc 
gcc 1344 

Asp Ser Arg Val Leu His Ala Val Lys Tyr Tyr Cys Gin Asn Phe 
50 Ala 

435 440 445 



34 
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ccc aac ttc aag gag age gag atg aat gec ate gcg gec gac atg 
tgc 1392 

Pro Asn Phe Lys Glu Ser Glu Met Asn Ala lie Ala Ala Asp Met 
Cys 

5 450 455 460 

acc aac gec cgc cgc gtc gtg cgc aag age tgg atg ccc aag gtc 
aag 1440 

Thr Asn Ala Arg Arg Val Val Arg Lys Ser Trp Met Pro Lys Val 
10 Lys 

465 470 475 

480 

gtg etc aag get gag gat gac gec tac acc acc ttc ate agt gaa 
15 acg 1488 

Val Leu Lys Ala Glu Asp Asp Ala Tyr Thr Thr Phe He Ser Glu 
Thr 

485 490 495 

20 ggc aag ate gag ccg gac atg atg ggt gtg gag cat ggc ttc gag 
acc 1536 

Gly Lys He Glu Pro Asp Met Met Gly Val Glu His Gly Phe Glu 
Thr 

500 505 510 

25 

gee age cac gag ggc gag gcg ggt ccc teg get gaa gee ctg cag 
taa 1584 

Ala Ser His Glu Gly Glu Ala Gly Pro Ser Ala Glu Ala Leu Gin 
515 520 525 

30 

cccgcccagc ctcccgcggg gccgcacact tcccctccca acacacacac 
acacctgcca 1644 

tcttggtcat gagctactgt ctgtccctcc ccaggacccg cggtgggtgc 
35 tgcatgttcc 1704 

cggccctctg cccctcctgt cctaccccct ttccccaccg agagctgggc 
egggagagga 1764 

40 ccgcagggca ggtggcgtga ggtccgtgtt gecttcttta acacacactc 
gtgcagtggg 1824 

ggagttctgg ctccccaacc taacccctag ccgtcatctc cacctaaaaa aaaaaaa 
1881 

45 



<210> 6 
50 <211> 2889 

<212> DNA 
<213> Homo sapiens 

35 



WO 02/077180 



PCT/US02/08964 



<220> 

<221> CDS 

<222> (14) . . (1744) 

<400> 6 

ctgtcctgga aag atg eta gca atg ggg gcg ctg gca gga ttc tgg 



ate 4 9 

10 He 



Met Leu Ala Met Gly Ala Leu Ala Gly Phe Trp 
1-5 10 



etc tgc etc etc act tat ggt tac ctg tec tgg ggc cag gec tta 
gaa 97 

15 Leu Cys Leu Leu Thr Tyr Gly Tyr Leu Ser Trp Gly Gin Ala Leu 
Glu 

15 20 25 

gag gag gaa gaa ggg gec tta eta get caa get gga gag aaa eta 
20 gag 145 

Glu Glu Glu Glu Gly Ala Leu Leu Ala Gin Ala Gly Glu Lys Leu 
Glu 

30 35 40 

25 ccc age aca act tec acc tec cag ccc cat etc att ttc ate eta 
gcg 193 

Pro Ser Thr Thr Ser Thr Ser Gin Pro His Leu He Phe He Leu 
Ala 

45 50 55 

30 60 

gat gat cag gga ttt aga gat gtg ggt tac cac gga tct gag att 
aaa 241 

Asp Asp Gin Gly Phe Arg Asp Val Gly Tyr His Gly Ser Glu He 
35 Lys 

65 70 75 

aca cct act ctt gac aag etc get gee gaa gga gtt aaa ctg gag 
aac 289 

40 Thr Pro Thr Leu Asp Lys Leu Ala Ala Glu Gly Val Lys Leu Glu 
Asn 

80 85 90 

tac tat gtc cag cct att tgc aca cca tec agg agt cag ttt att 
45 act 337 

Tyr Tyr Val Gin Pro lie Cys Thr Pro Ser Arg Ser Gin Phe lie 
Thr 

95 100 105 

50 gga aag tat cag ata cac acc gga ctt caa cat tct ate ata aga 
cct 385 

36 
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Gly Lys Tyr Gin lie His Thr Gly Leu Gin His Ser lie lie Arg 
Pro 

110 115 120 

5 acc caa ccc aac tgt tta cct ctg gac aat gcc acc eta cct cag 
aaa 433 

Thr Gin Pro Asn Cys Leu Pro Leu Asp Asn Ala Thr Leu Pro Gin 
Lys 

125 130 135 

10 140 

ctg aag gag gtt gga tat tea acg cat atg gtc gga aaa tgg cac 
ttg 481 

Leu Lys Glu Val Gly Tyr Ser Thr His Met Val Gly Lys Trp His 
15 Leu 

145 150 155 

ggt ttt tac aga aaa gaa tgc atg ccc acc aga aga gga ttt gat 
acc 529 

20 Gly Phe Tyr Arg Lys Glu Cys Met Pro Thr Arg Arg Gly Phe Asp 
Thr 

160 165 170 

ttt ttt ggt tec ctt ttg gga agt ggg gat tac tat aca cac tac 
25 aaa 577 

Phe Phe Gly Ser Leu Leu Gly Ser Gly Asp Tyr Tyr Thr His Tyr 
Lys 

175 180 185 

30 tgt gac agt cct ggg atg tgt ggc tat gac ttg tat gaa aac gac 
aat 625 

Cys Asp Ser Pro Gly Met Cys Gly Tyr Asp Leu Tyr Glu Asn Asp 
Asn 

190 195 200 

35 

get gcc tgg gac tat gac aat ggc ata tac tec aca cag atg tac 
act 673 

Ala Ala Trp Asp Tyr Asp Asn Gly lie Tyr Ser Thr Gin Met Tyr 
Thr 

40 205 210 215 

220 

cag aga gta cag caa ate tta get tec cat aac ccc aca aag cct 
ata 721 

45 Gin Arg Val Gin Gin lie Leu Ala Ser His Asn Pro Thr Lys Pro 
He 

225 230 235 

ttt tta tat att gcc tat caa get gtt cat tea cca ctg caa get 
50 cct 769 

Phe Leu Tyr He Ala Tyr Gin Ala Val His Ser Pro Leu Gin Ala 
Pro 

37 
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240 

ggc agg tat ttc 
agg 817 
5 Gly Arg Tyr Phe 
Arg 

255 

aga tat get gec 
10 gtg 865 

Arg Tyr Ala Ala 
Val 

270 

15 aca ttg get eta 
att 913 

Thr Leu Ala Leu 
He 
285 
20 300 

tac tct tea gat 
tgg 961 
Tyr Ser Ser Asp 
25 Trp 



cct etc aga ggt 
get 1009 
30 Pro Leu Arg Gly 
Ala 

320 

gta ggc ttt gtg 
35 tgt 1057 

Val Gly Phe Val 
Cys 

335 

40 aag gaa ctt gtg 
ctg 1105 

Lys Glu Leu Val 
Leu 

350 

45 

get gaa gga cag 
ate ~ 1153 

Ala Glu Gly Gin 
He 
50 365 
380 



245 

gaa cac tac cga tec 
Glu His Tyr Arg Ser 
260 

atg ctt tec tgc tta 
Met Leu Ser Cys Leu 
275 

aag act tat ggt ttc 
Lys Thr Tyr Gly Phe 
290 

aat ggt ggc cag cct 
Asn Gly Gly Gin Pro 
305 

age aaa gga aca tat 
Ser Lys Gly Thr Tyr 
325 

cat age cca ctt ctg 
His Ser Pro Leu Leu 
340 

cac ate act gac tgg 
His He Thr Asp Trp 
355 

att gat gag gac att 
He Asp Glu Asp He 
370 
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250 

att ate aac ata aac agg 
He He Asn lie Asn Arg 
265 

gat gaa gca ate aac aac 
Asp Glu Ala lie Asn Asn 
280 

tat aac aac age att ate 
Tyr Asn Asn Ser He He 
295 

acg gca gga ggg agt aac 
Thr Ala Gly Gly Ser Asn 
310 315 
tgg gaa gga ggg ate egg 
Trp Glu Gly Gly He Arg 
330 

aaa aac aag gga aca gtg 
Lys Asn Lys Gly Thr Val 
345 

tac ccc act etc att tea 
Tyr Pro Thr Leu He Ser 
360 

caa eta gat ggc tat gat 
Gin Leu Asp Gly Tyr Asp 
375 



38 
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tgg gag acc ata 
ttg 1201 

Trp Glu Thr lie 
Leu 

5 

cat aac att gac 
gca 1249 
His Asn lie Asp 
10 Ala 

400 

gca ggc tat ggg 
gtg 1297 
15 Ala Gly Tyr Gly 
Val 

415 

cag cac tgg aaa 
20 gtc 1345 

Gin His Trp Lys 
Val 

430 

25 ccc cct cag tct 
gaa 1393 

Pro Pro Gin Ser 
Glu 
445 
30 460 

egg ate acc ttg 
aca 1441 
Arg lie Thr Leu 
35 Thr 



gec gac cca tat 
ate 1489 
40 Ala Asp Pro Tyr 
He 

480 

gtg aag aag etc 
45 gtg 1537 

Val Lys Lys Leu 
Val 

495 

50 ccg gtc agg tat 
aat 1585 



agt gag ggt ctt cgc 
Ser Glu Gly Leu Arg 
385 

ccc ata tac acc aag 
Pro He Tyr Thr Lys 
405 

ate tgg aac act gca 
He Trp Asn Thr Ala 
420 

ttg ctt aca gga aat 
Leu Leu Thr Gly Asn 
435 

ttc age aac ctg gga 
Phe Ser Asn Leu Gly 
450 

tea act ggc aaa agt 
Ser Thr Gly Lys Ser 
465 

gag agg gtg gac eta 
Glu Arg Val Asp Leu 
485 

eta egg agg etc tea 
Leu Arg Arg Leu Ser 
500 

ccc ccc aaa gac ccc 
39 
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tea ccc cga gta gat att 

Ser Pro Arg Val Asp He 

390 395 

gca aaa aat ggc tec tgg 

Ala Lys Asn Gly Ser Trp 

410 

ate cag tea gee ate aga 
He Gin Ser Ala He Arg 
425 

cct ggc tac age gac tgg 
Pro Gly Tyr Ser Asp Trp 
440 

ccg aac egg tgg cac aat 
Pro Asn Arg Trp His Asn 
455 

gta tgg ctt ttc aac ate 
Val Trp Leu Phe Asn He 
470 475 
tct aac agg tat cca gga 
Ser Asn Arg Tyr Pro Gly 
490 

cag ttc aac aaa act gca 
Gin Phe Asn Lys Thr Ala 
505 

aga agt aac cct agg etc 
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Pro Val Arg Tyr Pro Pro Lys Asp Pro Arg Ser Asn Pro Arg Leu 
Asn 

510 515 520 

5 gga ggg gtc tgg gga cca tgg tat aaa gag gaa acc aag aaa aag 
aag 1633 

Gly Gly Val Trp Gly Pro Trp Tyr Lys Glu Glu Thr Lys Lys Lys 
Lys 

525 530 535 

10 540 

cca age aaa aat cag get gag aaa aag caa aag aaa age aaa aaa 
aag 1681 

Pro Ser Lys Asn Gin Ala Glu Lys Lys Gin Lys Lys Ser Lys Lys 
15 Lys 

545 550 555 

aag aag aaa cag cag aaa gca gtc tea ggt tea act tgc cat tea 
ggt 1729 

20 Lys Lys Lys Gin Gin Lys Ala Val Ser Gly Ser Thr Cys His Ser 
Gly 

560 565 570 

gtt act tgt gga taa gcacaaatat ttcctgtttg gttaaacttt 
25 aatcagttct 1784 
Val Thr Cys Gly 
575 

tatctttcat ctgtttccta ggtaaaccag caaatttggc tcgataatat 
30 cgctggccta 1844 

agegtcagge ttgttttcat gctgtgccac tccagagact tctgccacct 
ggccgccaca 1904 

35 ctgaaaactg tcctgctcag tgccaaggtg ctactcttgc aagccacact 
tagagagagt 19 64 



40 



ggagatgttt atttctctcg ctcctttaga aaacgtggtg agtcctgagt 

tccactgctg 2024 

tgcttcagtc aactgaccaa acactgettt gaattatagg aggagaacaa 

taacctacca 2084 



tccgcaagca tgctaatttg atggaagtta cagggtagca tgattaaaac 
45 tacctttgat 2144 

aaattacagt caaagattgt gtcacctcaa aggecttgaa gaatatattt 
tcttggtgaa 2204 

50 tttttgtatg tctgtcatat gacacttggg ttttttaatt aattctattt 
tatatatata 2264 

40 
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aatatatgtt tcttttcctg tgaaaagctg tttttctcac atgtgaacag 
cttgcacctc 2324 

attttaccat gcgtgaggga atggcaaata agaatgtttg agcacactgc 
5 ccacaatgaa 2384 

tgtaactatt ttctaaacac tttactagaa gaacatttca gtataaaaaa 
cctaatttat 2444 

10 ttttacagaa aaatattttg ttgtttttat aaaaagttat gcaaatgact 
tttattttta 2504 

tttcctgcat accattagaa gaattttatt tcatttcttc aaattatcaa 

gcactgtaat 2564 

15 

acctataaat taatgtaata ctgtgtgaat tcagactatt aaaaacatca 

ttcagaaaac 2624 

tttataatcg tcattgttca atcaagattt tgaatgtaat aagatgaata 
20 tattccttac 2684 

aaattacttg gaaattcaat gtttgtgcag agttgagaca actttattgt 
ttctatcata 2744 

25 aactatttat gtatcttaat tattaaaatg atttacttta tggcactaga 
aaatttactg 2804 

tggcttttct gatctaactt ctagctaaaa ttgtatcatt ggccctaaaa 
aataaaaatc 2864 

30 

tttactaata ggcaaaaaaa aaaaa 
2889 



35 

<210> 7 

<211> 1356 

<212> DNA 

40 <213> Homo sapiens 

<220> 

<221> CDS 

<222> (51) . . (1061) 

45 

<400> 7 

cggacgcgtg ggccgggggt tcgcccgcgg aggccgggga gcagccgacc 
gag 56 

50 Glu 



atg 
Met 
1 



41 
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ccc cag aaa ate 
ggc 104 

Pro Gin Lys lie 
Gly 

5 5 

aaa gtg act cat 
aaa 152 
Lys Val Thr His 
10 Lys 

20 

gec aaa cgc caa 
gaa 200 
15 Ala Lys Arg Gin 
Glu 
35 

50 

20 gaa gaa gac ccc 
tat 248 

Glu Glu Asp Pro 
Tyr 

25 

gca acg aag cat 
aga 296 

Ala Thr Lys His 
Arg 

30 ~ 70 

ttt cgc ctg aag 
cca 344 
Phe Arg Leu Lys 
35 Pro 

85 

etc gtc tea gtg 
ttg 392 
40 Leu Val Ser Val 
Leu 

100 

cac egg get gtc 
45 ctt 440 

His Arg Ala Val 
Leu 

115 
130 

50 

caa gga ggc cgt 
acc 488 



atg cca ccc tea aag cct 
Met Pro Pro Ser Lys Pro 
10 

cac age att gaa tta tac 
His Ser lie Glu Leu Tyr 
25 

gga cct caa gag cag tgg 
Gly Pro Gin Glu Gin Trp 
40 

aaa atg cac act tat ggt 
Lys Met His Thr Tyr Gly 
55 60 
gtt gtt gaa ggt ctg gaa 
Val Val Glu Gly Leu Glu 

75 

gtc acc age ccc tct ggg 
Val Thr Ser Pro Ser Gly 
90 

tct aca acc aga gag ccc 
Ser Thr Thr Arg Glu Pro 
105 

a £t gtg aat 9 at 9 aa 9 at 
Ser Val Asn Asp Glu Asp 
120 

gtt aag gtt gat gtt ccc 

42 
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cat cca cct gtc gtg 

His Pro Pro Val Val 

15 

tgg gait ctg gaa aag 
Trp Asp Leu Glu Lys 
30 

ttc agg ttc teg att 
Phe Arg Phe Ser lie 
45 

ate att tat acg gga 
lie lie Tyr Thr Gly 

65 

cca agg acg ctg tac 
Pro Arg Thr Leu Tyr 
80 

g a 9 tgt gag tac age 
Glu Cys Glu Tyr Ser 
95 

ata agt agt gag cac 
lie Ser Ser Glu His 
110 

ttg ctg gtc cga ata 
Leu Leu Val Arg lie 
125 

aat aag ttt ggc ttt 
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Gin Gly Gly Arg Val Lys Val Asp Val Pro Asn Lys Phe Gly Phe 
Thr 

135 140 145 

get ctg atg gtt get gee cag aaa gga tac ace agg ctt gtg aaa 
ate 536 

Ala Leu Met Val Ala Ala Gin Lys Gly Tyr Thr Arg Leu Val Lys 
lie 

150 155 160 



10 



eta gtt tct aat ggc aca gac gtg aat ctg aag aat gga agt ggc 
aag 584 

Leu Val Ser Asn Gly Thr Asp Val Asn Leu Lys Asn Gly Ser Gly 
Lys 

15 165 170 175 

gac agt eta atg ctg gcg tgc tat gcg gga cac eta gat gtt gtg 
aaa 632 

Asp Ser Leu Met Leu Ala Cys Tyr Ala Gly His Leu Asp Val Val 
20 Lys 

180 185 190 

tat etc cga aga cat ggc get tct tgg cag get aga gac ctg gga 
ggc 680 

25 Tyr Leu Arg Arg His Gly Ala Ser Trp Gin Ala Arg Asp Leu Gly 
Gly 

195 200 205 

210 

30 tgt aca get ctg cac tgg get gca gat gga ggc cac tgc agt gtg 
att 728 

Cys Thr Ala Leu His Trp Ala Ala Asp Gly Gly His Cys Ser Val 
He 

215 220 225 

35 

gag tgg atg ata aag gat ggc tgt gag gta gac gtc gtg gac act 
ggt 776 

Glu Trp Met He Lys Asp Gly Cys Glu Val Asp Val Val Asp Thr 
Gly 

40 230 235 240 

tea gga tgg acc cca etc atg aga gtc tct gcg gtg teg gga aat 
cag 824 

Ser Gly Trp Thr Pro Leu Met Arg Val Ser Ala Val Ser Gly Asn 
45 Gin 

245 250 255 

agg gtg gee tct ctt eta att gat get ggg gee aat gtg aat gtg 
aag 872 

50 Arg Val Ala Ser Leu Leu He Asp Ala Gly Ala Asn Val Asn Val 
Lys 

260 265 270 

43 
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gac aga aat gga aag acg ccc ctt atg gtg get gtg tta aat aat 
cat 920 

Asp Arg Asn Gly Lys Thr Pro Leu Met Val Ala Val Leu Asn Asn 
5 His 

275 280 285 

290 

gaa gag tta gtt cag tta ctt ctt gac aaa ggg gca gat gca agt 
10 gta 968 

Glu Glu Leu Val Gin Leu Leu Leu Asp Lys Gly Ala Asp Ala Ser 
Val 

295 300 305 

15 aaa aat gag ttc ggc aaa ggt gtc eta gaa atg gec aga gtt ttt 
gac 1016 

Lys Asn Glu Phe Gly Lys Gly Val Leu Glu Met Ala Arg Val Phe 
Asp 

310 315 320 



20 



25 



aga cag gtt ggg atg etc ttt ctg cct ate aag get aat ttt tag 
tec 1064 

Arg Gin Val Gly Met Leu Phe Leu Pro lie Lys Ala Asn Phe 
325 " 330 335 

ttctagactc agaaaaacca ccaaaaccct ggattacatt cgcctgtctg 
ttttgtttat 1124 



ctagagtgta gtctccttat tagaagaaag gaaaaaaaag cagaggecaa 
30 agaagtcttg 1184 

tgtctgctga tgagagcacc actcatctgc gaaacgcacg taaaacaaag 
tgaaccgtga 1244 

35 ctgttaaact agggatggga aattctgeat cttggggggc tgtacattta 
tttatttagt 1304 



40 



tgaagattca ctgatcccac tttgaaatac atctttttac ctaaaaaaaa aa 
1356 



<210> 8 

45 <211> 2124 

<212> DNA 

<213> Homo sapiens 

<220> 

50 <221> CDS 

<222> (1) . . (1866) 



44 
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<400> 8 

atg gcg ctg ggc ttg gag cag gcg gag gag cag egg ttg tac cag 
cag 48 

Met Ala Leu Gly Leu Glu Gin Ala Glu Glu Gin Arg Leu Tyr Gin 
5 Gin 

15 10 15 

acg etc ctg caa gac ggg etc aaa gac atg ctg gac cat ggc aag 
ttc 96 

10 Thr Leu Leu Gin Asp Gly Leu Lys Asp Met Leu Asp His Gly Lys 
Phe 

20 25 30 

etc gac tgt gtg gtg egg gcg ggc gag cgc gag ttc ccg tgc cat 
15 cgc 144 

Leu Asp Cys Val Val Arg Ala Gly Glu Arg Glu Phe Pro Cys His 
Arg 

35 40 45 

20 ctg gtg ctg gee gee tgc age ccc tac ttc egg gcg cgc ttt eta 
gee 192 

Leu Val Leu Ala Ala Cys Ser Pro Tyr Phe Arg Ala Arg Phe Leu 
Ala 

50 55 60 

25 

gag ccg gag cgc gcg ggc gag ctg cac ctg gag gag gtg tec ccg 
gac 240 

Glu Pro Glu Arg Ala Gly Glu Leu His Leu Glu Glu Val Ser Pro 
Asp 

30 65 70 75 

80 

gtg gtg gec cag gtg ctg cac tac ctg tac aca tea gag ate gcg 
ctg 288 

35 Val Val Ala Gin Val Leu His Tyr Leu Tyr Thr Ser Glu lie Ala 
Leu 

85 90 95 

gat gag gcg age gtg cag gat ttg ttc gee gcg gca cac cgc ttc 
40 cag 33 6 

Asp Glu Ala Ser Val Gin Asp Leu Phe Ala Ala Ala His Arg Phe 
Gin 

100 105 110 

45 ate cct tec ate ttc acc ate tgc gtg tec ttc ctg cag aag cgc 
ctg 384 

lie Pro Ser lie Phe Thr lie Cys Val Ser Phe Leu Gin Lys Arg 
Leu 

115 120 125 

50 

tgc etc tec aac tgc ttg gee gtc ttc cgt etc ggc etc ctg etc 
gac 432 

45 
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Cys Leu Ser Asn Cys Leu Ala Val Phe Arg Leu Gly Leu Leu Leu 
Asp 

130 135 140 

5 tgc gcg cgt etc gec gtg get gec cgc gac ttc ate tgc get cac 
ttc 480 

Cys Ala Arg Leu Ala Val Ala Ala Arg Asp Phe He Cys Ala His 
Phe 

145 150 155 

10 160 

acg ctg gtg gcg cgc gac get gac ttc etc gga etc teg gec gac 
gag 528 

Thr Leu Val Ala Arg Asp Ala Asp Phe Leu Gly Leu Ser Ala Asp 
15 Glu 

165 170 175 

etc ate gec ate ate tec age gac ggc ctt aac gtg gag aag gag 
gag 576 

20 Leu He Ala He He Ser Ser Asp Gly Leu Asn Val Glu Lys Glu 
Glu 

180 185 190 

gca gtg ttc gag gcg gtg atg egg tgg gcg ggt age ggc gac gec 
25 gag 624 

Ala Val Phe Glu Ala Val Met Arg Trp Ala Gly Ser Gly Asp Ala 
Glu 

195 200 205 

30 gcg cag get gag cgc cag cgc gcg ctg ccc acc gtc ttc gag age 
gtg 672 

Ala Gin Ala Glu Arg Gin Arg Ala Leu Pro Thr Val Phe Glu Ser 
Val 

210 215 220 

35 

cgc tgc cgc ttg ctg ccg cgc gee ttt ctg gaa age cgc gtg gag 
cgc 720 

Arg Cys Arg Leu Leu Pro Arg Ala Phe Leu Glu Ser Arg Val Glu 
Arg 

40 225 230 235 

240 

cac cct etc gtg cgt gee cag ccc gag ttg ctg cgc aag gtg cag 
atg 768 

45 His Pro Leu Val Arg Ala Gin Pro Glu Leu Leu Arg Lys Val Gin 
Met 

245 250 255 

gtg aag gat gca cac gag ggc cgc ate acc acg ctg egg aag aaa 
50 aag 816 

Val Lys Asp Ala His Glu Gly Arg He Thr Thr Leu Arg Lys Lys 
Lys 
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260 

aa 9 999 aa 9 9 at 
age 864 
5 Lys Gly Lys Asp 
Ser 

275 

aaa gec aaa gca 
10 ggg 912 

Lys Ala Lys Ala 
Gly 

290 

15 ate etc aat gac 
ate 960 

lie Leu Asn Asp 
He 
305 
20 320 

ttc atg ate agt 
aac 1008 
Phe Met He Ser 
25 Asn 



gag tgc tac tgt 
gtc 1056 
30 Glu Cys Tyr Cys 
Val 

340 

age ctg gtt ace 
35 ttc 1104 

Ser Leu Val Thr 
Phe 

355 

40 tac aac gaa gac 
cag 1152 

Tyr Asn Glu Asp 
Gin 

370 

45 

ttt gac cat ctg 
teg 1200 

Phe Asp His Leu 
Ser 
50 385 
400 



265 

gg a gec ggg gee aag 
Gly Ala Gly Ala Lys 
280 

gag gag gat gag gag 
Glu Glu Asp Glu Glu 
295 

acc ctg cgc ttc ggc 
Thr Leu Arg Phe Gly 
310 

gag gag ggc get gtg 
Glu Glu Gly Ala Val 
325 

get tec etc tec aac 
Ala Ser Leu Ser Asn 
345 

aag gag aac cag gtc 
Lys Glu Asn Gin Val 
360 

aac aaa gag gac ccc 
Asn Lys Glu Asp Pro 
375 

gac tea gag tgg ctg 
Asp Ser Glu Trp Leu 
390 



PCT/US02/08964 
270 

gag get gat aag ggc aca 
Glu Ala Asp Lys Gly Thr 
285 

gee gaa cgt ate ctt cct 
Ala Glu Arg He Leu Pro 
300 

atg ttc ctg cag gat etc 
Met Phe Leu Gin Asp Leu 
315 

gee tac gat cca gca gee 
Ala Tyr Asp Pro Ala Ala 
330 335 
cag gtc ccc aag aac cac 
Gin Val Pro Lys Asn His 
350 

ttc gtg get gga ggc etc 
Phe Val Ala Gly Gly Leu 
365 

atg age gca tac ttc ctg 
Met Ser Ala Tyr Phe Leu 
380 

ggg atg cca ccg ctg ccc 
Gly Met Pro Pro Leu Pro 
395 
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ccc cgc tgc 
gtg 1248 

Pro Arg Cys 
Val 

5 

gtc ggt ggc 
gtc 1296 
Val Gly Gly 
10 Val 



atg tgc tac 
ctg 1344 
15 Met Cys Tyr 
Leu 

435 

cct tac gtg 
20 gtc 1392 
Pro Tyr Val 
Val 

450 

25 tac gta att 
atg 1440 
Tyr Val lie 
Met 
465 
30 480 

tgc gtc tat 

atg 1488 

Cys Val Tyr 
35 Met 



cag acc gcc 

att 1536 

40 Gin Thr Ala 
He 



ate gtg gca 

45 gaa 1584 

He Val Ala 
Glu 

515 

50 gtg tac age 

cca 1632 



etc ttt ggc ctg 

Leu Phe Gly Leu 
405 

aga gag ate aag 

Arg Glu He Lys 
420 

gac agg ctg tea 

Asp Arg Leu Ser 

gtg tat ggc cac 
Val Tyr Gly His 
455 

ggc ggc aaa ggc 
Gly Gly Lys Gly 
470 

gac ccc aag aag 
Asp Pro Lys Lys 
485 

cgc tea etc ttt 
Arg Ser Leu Phe 
500 

get ggg gtc acc 
Ala Gly Val Thr 

ate aca gac aac 



gga gaa get etc 
Gly Glu Ala Leu 
410 

gac ggc gag cgc 
Asp Gly Glu Arg 
425 

ttc aaa tgg ggt 
Phe Lys Trp Gly 
440 

aca gtg etc tec 
Thr Val Leu Ser 

agt gac agg aag 
Ser Asp Arg Lys 
475 

ttt gag tgg aag 
Phe Glu Trp Lys 
490 

999 gcc act gtc 
Gly Ala Thr Val 
505 

gac aca ggg ctg 
Asp Thr Gly Leu 
520 

aag tgg gca ccc 

48 
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aac tec ate tac 

Asn Ser He Tyr 

415 

tgc ctg gac teg 
Cys Leu Asp Ser 
430 

gaa teg gac ccg 
Glu Ser Asp Pro 
445 

cac atg gac ctt 
His Met Asp Leu 
460 

tgc ctg aac aag 
Cys Leu Asn Lys 

gag ctg gca ccc 
Glu Leu Ala Pro 
495 

cat gat ggc cgc 
His Asp Gly Arg 
510 

acc agt tct gcc 
Thr Ser Ser Ala 
525 

ttc gag gcc ttc 
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Val Tyr Ser lie Thr Asp Asn Lys Trp Ala Pro Phe Glu Ala Phe 
Pro 

530 535 540 

5 cag gag cgt age tea etc age ctg gtc age ctg gtg ggt ace etc 
tat 1680 

Gin Glu Arg Ser Ser Leu Ser Leu Val Ser Leu Val Gly Thr Leu 
Tyr 

545 550 555 

10 560 

gec att ggt ggc ttt gee aca ctg gag acg gag tct gga gag ctg 
gtt 1728 

Ala lie Gly Gly Phe Ala Thr Leu Glu Thr Glu Ser Gly Glu Leu 
15 Val 

565 570 575 

ccc aca gag etc aat gac ate tgg agg tat aac gag gag gag aag 
aaa 1776 

20 Pro Thr Glu Leu Asn Asp lie Trp Arg Tyr Asn Glu Glu Glu Lys 
Lys 

580 585 590 

tgg gag ggt gtc ctg egg gag ate gec tat gca gca ggt gee ace 
25 ttc 1824 

Trp Glu Gly Val Leu Arg Glu He Ala Tyr Ala Ala Gly Ala Thr 
Phe 

595 600 605 

30 eta cca gtg egg etc aat gtg ctg cgc ctg act aag atg tga 
ccagctc 1873 

Leu Pro Val Arg Leu Asn Val Leu Arg Leu Thr Lys Met 

610 615 620 

35 aggcagactg aactaagcac ccctcccatc ctgcgaccct cactggcctg 
gccttgtggg 1933 



40 



ggctccagaa aagaggctag gagaggecag agtctacctg gatccagtta 
tggtgcctca 1993 

ggggctgcgt cagecaagga aagggaagtg ctgcttagtc ctggactttt 
gggccagggt 2053 



gagaaactag aggcttctcc agtgttgcca tatcccccta ggttgtcttg 
45 atccatgaac 2113 

cagaaccaca g 
2124 

50 
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<210> 9 
<211> 4414 
<212> DNA 

<213> Homo sapiens 

5 

<220> 
<221> CDS 

<222> (991) . . (2400) 

10 <400> 9 

gtcacacatg gacagcctca tctattttat tgacccattt ccagatgctg 
ggattaccga 60 

gttacattaa tacacagctt tttatggtat acagtaatgt aatatattag 
15 cctaatgtgt 120 

taggactcct taaacaaggg atactgtaat cattaataca gttgatccat 
taggcttggc 180 

20 acagtgttta ttttatggaa acactctaaa agaaaaacag acctgaagtt 
catcatgtgt 240 

aaggttacac aaatctatct ttcaaatgag ttcattactc taaataattg 
tcagaagtta 300 

25 

catttttgtt taatataatc tttgcagtaa ccttccaaat tagggcaatt 
tttctgtcca 360 

gatctgccta ctgaactaga atatttaatg agcaacaact tctgttgtga 
30 attaacggac 420 

tgatattggg gtttgataaa tcagtttcct aaggtttgga taagccaaga 
gttagactct 480 

35 aagaatataa aaaggcctag ataacatcat aaaagtaagg ttctagatcc 
acaaagcaga 540 

gaactagacc cagattataa gtcactccag aggtcccatt tttctggatt 
tgagggagga 600 

40 

agtgtttgat gacagtattt gaagtgattt tatatccatt ttcttttaaa 
aagtaatagt 660 

ttggaattag ttacttgtta attaatctgt aaaatatttg ctgtggtgtt 
45 ataatgtgaa 720 

aatgtggttg ctaattttaa aataaatttt gggtttttta aagtaaattt 
ttaaaaacac 780 

50 tttttatgta tatcacagcc tcaacaagaa cttgatttaa actctccacg 
aaataccact 840 

50 



WO 02/077180 PCT/US02/08964 

ttggaaagac agactttcta ttgtgttccg gtgcctgggg aatctacgtg 
ggtaaaagaa 900 

gcctatgtta atgcaaacca agctcgagtc agtccctcaa catcctacac 
5 tcctagtcgc 960 

cacaagagga gttatgaaga tgatgacgat atg gac eta cag ccc aat 

aag 1011 

Met Asp Leu Gin Pro Asn 

10 Lys 

1 5 

cag aaa gac caa cat gca ggt gec aga caa gca ggg agt gtt ggt 
ggt 1059 

15 Gin Lys Asp Gin His Ala Gly Ala Arg Gin Ala Gly Ser Val Gly 
Gly 

10 15 20 

ctt caa tgg tgt gga gag cca aaa cgt tta gaa act gaa get tct 
20 act 1107 

Leu Gin Trp Cys Gly Glu Pro Lys Arg Leu Glu Thr Glu Ala Ser 
Thr 

25 30 35 

25 ggg caa cag ctg aac tct ctg aac ttg tct tct cct ttt gat ttg 
aat 1155 

Gly Gin Gin Leu Asn Ser Leu Asn Leu Ser Ser Pro Phe Asp Leu 
Asn 

40 45 50 

30 55 

ttt cca ttg cca gga gag aag ggc cct gca tgc ctt gtg aag gtt 
tat 1203 

Phe Pro Leu Pro Gly Glu Lys Gly Pro Ala Cys Leu Val Lys Val 
35 Tyr 

60 65 70 

gaa gat tgg gat tgt ttc aaa gta aat gac att ctt gag eta tat 
ggc 12 51 

40 Glu Asp Trp Asp Cys Phe Lys Val Asn Asp He Leu Glu Leu Tyr 
Gly 

75 80 85 

ata ctg tct gtg gat cct gtg ctg agt ata ctg aat aat gat gaa 
45 agg 1299 

He Leu Ser Val Asp Pro Val Leu Ser He Leu Asn Asn Asp Glu 
Arg 

90 95 100 

50 gat gee tct gca ctg ctg gat ccg atg gag tgc aca gac aca gca 
gag 1347 

51 
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Asp Ala Ser Ala Leu Leu Asp Pro Met Glu Cys Thr Asp Thr Ala 
Glu 

105 110 115 

5 gag cag aga gta cac agt cct cct get tea tta gtg ccg aga att 
cat 1395 

Glu Gin Arg Val His Ser Pro Pro Ala Ser Leu Val Pro Arg lie 
His 

120 125 130 

10 135 

gtg ate tta gec cag aag ttg caa cac ate aac cca tta ttg cct 
gee 1443 

Val lie Leu Ala Gin Lys Leu Gin His lie Asn Pro Leu Leu Pro 
15 Ala 

140 145 150 

tgc ctt aac aaa gag gag age aaa acc tgt aag ttt gtt tea agt 
ttc 1491 

20 Cys Leu Asn Lys Glu Glu Ser Lys Thr Cys Lys Phe Val Ser Ser 
Phe 

155 160 165 

atg tec gaa ttg tct cca gtc aga gca gaa ctt ctt ggg ttc ctt 
25 act 1539 

Met Ser Glu Leu Ser Pro Val Arg Ala Glu Leu Leu Gly Phe Leu 
Thr 

170 175 180 

30 cat gec ctt ctg ggg gat agt ttg get get gaa tac ctt ata tta 
cat 1587 

His Ala Leu Leu Gly Asp Ser Leu Ala Ala Glu Tyr Leu lie Leu 
His 

185 190 195 

35 

etc ate tec aca gta tat aca aga aga gat gtc ctt cca eta gga 
aaa 1635 

Leu lie Ser Thr Val Tyr Thr Arg Arg Asp Val Leu Pro Leu Gly 
Lys 

40 200 205 210 

215 

ttt aca gtt aac ttg agt ggt tgc cca egg aat agt acc ttc aca 
gaa 1683 

45 Phe Thr Val Asn Leu Ser Gly Cys Pro Arg Asn Ser Thr Phe Thr 
Glu 

220 225 230 

cac ttg tat cga att att caa cat ctt gtt cca gca tct ttt cgt 
50 ctg 1731 

His Leu Tyr Arg lie lie Gin His Leu Val Pro Ala Ser Phe Arg 
Leu 

52 
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235 

cag atg act ata gag 
aaa 1779 
5 Gin Met Thr He Glu 
Lys 

250 

gac tac aca gcc aat 
10 age 1827 

Asp Tyr Thr Ala Asn 
Ser 

265 

15 aat act tec ctt gta 
ctg 1875 

Asn Thr Ser Leu Val 
Leu 
280 
20 295 

gat acc cca ggt gtt 
acg 1923 
Asp Thr Pro Gly Val 
25 Thr 

300 

tgg cag aag gtg gat 
ccc 1971 
30 Trp Gin Lys Val Asp 
Pro 

315 

tgc aat att aac gtt 
35 ccg 2019 

Cys Asn He Asn Val 
Pro 

330 

40 gca gac tgc cag att 
atg 2067 

Ala Asp Cys Gin He 
Met 

345 

45 

gag gag tac atg aac 
ctg 2115 

Glu Glu Tyr Met Asn 
Leu 
50 360 
375 



240 

aac atg aac cat ttg aaa 
Asn Met Asn His Leu Lys 
255 

cgc ttg gtc agt ggg etc 
Arg Leu Val Ser Gly Leu 
270 

ate gat gag act etc ctg 
He Asp Glu Thr Leu Leu 
285 290 

cat aat gtg aca gcc ctg 
His Asn Val Thr Ala Leu 
305 

tat gac ttc age tac cat 
Tyr Asp Phe Ser Tyr His 
320 

ttc att act teg gag ggg 
Phe lie Thr Ser Glu Gly 
335 

cac tta cag ccc cag eta 
His Leu Gin Pro Gin Leu 
350 

age ctt etc tea gcg gtg 
Ser Leu Leu Ser Ala Val 
365 370 
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245 

ttc att ccc cac 
Phe lie Pro His 
260 

etc cag ctg ccc 
Leu Gin Leu Pro 
275 

gaa cag ggg cag 
Glu Gin Gly Gin 

age aac etc ata 
Ser Asn Leu lie 
310 

cag atg gaa ttc 
Gin Met Glu Phe 
325 

agg tea etc etc 
Arg Ser Leu Leu 
340 

att cca cca aac 
He Pro Pro Asn 
355 

ctg cct tec gtg 
Leu Pro Ser Val 



53 



WO 02/077180 PCT/US02/08964 

aac aaa ttc cgc att tat eta act ctt ttg aga ttc ttg gaa tat 
age 2163 

Asn Lys Phe Arg lie Tyr Leu Thr Leu Leu Arg Phe Leu Glu Tyr 
Ser 

5 380 385 390 

ata tct gat gaa ata acc aag gca gtt gaa gat gac ttt gtg gaa 
atg 2211 

lie Ser Asp Glu lie Thr Lys Ala Val Glu Asp Asp Phe Val Glu 
10 Met 

395 400 405 

egg aag aac gac cct cag age ate act get gat gat ctt cac cag 
ctg 2259 

15 Arg Lys Asn Asp Pro Gin Ser lie Thr Ala Asp Asp Leu His Gin 
Leu 

410 415 420 

etc gtg gtg get egg tgt ctg tct etc agt get ggt cag aca acg 
20 ctg 2307 

Leu Val Val Ala Arg Cys Leu Ser Leu Ser Ala Gly Gin Thr Thr 
Leu 

425 430 435 

25 tea aga gaa cga tgg ctg aga gca aag cag eta gag tct tta aga 
aga 2355 

Ser Arg Glu Arg Trp Leu Arg Ala Lys Gin Leu Glu Ser Leu Arg 
Arg 

440 445 450 

30 455 

acg agg ctt cag cag caa aaa tgt gtg aat gga aat gaa ctt taa 
aga 2403 

Thr Arg Leu Gin Gin Gin Lys Cys Val Asn Gly Asn Glu Leu 
35 460 465 

tgtaatacct atgaagagta atgggcaaac tgtagecaca taattgtaaa 
attcagatat 2463 

40 tcatttatac cacattgttt tataggtaat ttctatcaca aaccagtgac 
atttcctgaa 2523 



45 



ateaagectg gtaacacctg atgtttatat gatattcagt aaggactttt 
accttactga 2583 

tttcatggag cttttgaagt ttgttttata ataattatat aaattagtaa 
tgatgtaaaa 2643 



aaagtatttg atattaaaag tttaatattg ataatgttgc tgattgtacc 
50 atttccttag 2703 
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cttcagctga gtcataggcc agactgttga aatgctgaaa tgaagaaggt 
tgttgcagtt 2763 

tcaaagtcag aggaatcgtg cttcggattt cttatgtttt ctagttctct 
5 gtttttccag 2823 

ttcacagtgg gttggggtgc attcagtagt ccatctttgg ggaacggagg 
cgtacttgcc 2883 

10 attgattcac atgactacat gaaattctgt actgtcattt cccagatgtt 
tggccacaga 2943 

aactttttcc cacttaacat ttgttaacag cctgcaaaac taaacttgta 
catggcagtg 3003 

15 

gttcccagac ttttgtattt tatggaccgg tagtaatatt tccaaaaatc 
tggggtacta 3063 

taaggttgcc aatttacctt gccaagtaat ccgaataaat cactgtatta 
20 tcaccatttt 3123 

tttcataaaa ggaaaggaca atctatctct gaataagagg agtcctttaa 
acggaatgaa 3183 

25 tgtggctttt gggggcaaaa gaaaccaaga cactacattg tctttatttt 
ctcctatccc 3243 

agtgcatttg agaaccatgc ataagggaat gctgtgctac aaagctgtgc 
ccaaatatga 3303 

30 

aaacaaaata ggaaacttaa aaagcaatac cccctttaga aagtttttat 
tttcttaaat 3363 

gtcattgagt tgctttgatt ctattggatt tttggcattt tttatgggat 
35 catcagttgg 3423 

ttccaagtat gttagatcag ctaacatctg ctactccagt aacagcctcg 
tacaactgca 3483 

40 ggtaggtttt ctccagacca attagtttta atagagcaaa ctaacaacag 
actgtagtag 3543 

catggttatg gcaaccagaa tcttcagaaa ggttaggaca ttacttttta 
agctgtcagt 3603 

45 

ggtatcaagt aacttaccta gttggaggca gataaaggat cccttacgtt 
ttttcctata 3663 

aggcctaaat tgaaattgtt aaccaaggaa acagggtcag ccttgaaaaa 
50 tcaaggaatt 3723 
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cattgtacct aataactgaa gtaaaaataa ctagttgttc aacttttcct 
aaactcaaat 3783 

ctatttttat aaacaaatgt aaataatgtt tatattagag ttgaactggt 
5 tttcattttt 3843 

ataactggta gactagacct tccttaaact tttagaaata aaatgaaggc 
ttcactggat 3903 

10 ttgtgaggat aaaatacatt ttctttaatt gtcctagagc aaagtacatt 
agtcaccatg 3963 

tgttttttgt gccaatgtaa attgtaattt accaaagaaa aatacataca 
ttgcttggtc 4023 

15 

ttgcagaaaa gttcccttga aagaaccttt ccaataaata aaacgtccca 
aattagcagt 4083 

accttgggct gtttttcatg agtaagaaga ttcaccatcc catgtgatct 
20 gtgtggaaaa 4143 

agaccatgtc ctcttggtgg aagacatgag agagctgaac tgaagtggag 

gaggtggtgc 4203 

25 aagagggacc ttcctgctca aggcccgccc aggcagcgga atagagtgca 
gtgcttggct 4263 

gcagaaaccc tttgtccctc acctatatat acacggacag tcaagtttgt 

tgctctaacg 4323 

30 

taaggcacag cgttaatcct gtatggccag gaaactgagt agactcctgt 

gtaaccctgt 4383 

ttggaacttt gccttcttaa aatgattttt c 
35 4414 



40 <210> 10 



<211> 1467 

<212> DNA 

<213> Homo sapiens 



45 



<220> 

<221> CDS 

<222> (46) . . (1467) 



50 



<400> 10 

gcctgccgta ccggtccgga attcccgggt cgacgatttc gtgcc 
tec 54 



atg tgc 
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Met Cys 

Ser 



ggg etc ctg gag etc ctg ctg ccc ate tgg etc tec tgg ace ctg 
ggg 102 

Gly Leu Leu Glu Leu Leu Leu Pro lie Trp Leu Ser Trp Thr Leu 
Gly 

5 10 15 



10 



ace cga ggc tct gag ccc cgc agt gtg aac gat ccc ggg aac atg 
tec 150 

Thr Arg Gly Ser Glu Pro Arg Ser Val Asn Asp Pro Gly Asn Met 
Ser 

15 20 25 30 

35 

ttt gtg aag gag acg gtg gac aag ctg ttg aaa ggc tac gac att 
cgc 198 

20 Phe Val Lys Glu Thr Val Asp Lys Leu Leu Lys Gly Tyr Asp He 
Arg 

40 45 50 

eta aga ccc gac ttc ggg ggt ccc ccg gtc tgc gtg ggg atg aac 
25 ate 246 

Leu Arg Pro Asp Phe Gly Gly Pro Pro Val Cys Val Gly Met Asn 
He 

55 60 65 

30 gac ate gee age ate gac atg gtt tec gaa gtc aac atg gat tat 
ace 294 

Asp He Ala Ser He Asp Met Val Ser Glu Val Asn Met Asp Tyr 
Thr 

70 75 80 

35 

tta ace atg tat ttt caa caa tat tgg aga gat aaa agg etc gee 
tat 342 

Leu Thr Met Tyr Phe Gin Gin Tyr Trp Arg Asp Lys Arg Leu Ala 
Tyr 

40 85 90 95 

tct ggg ate cct etc aac etc acg ctt gac aat cga gtg get gac 
cag 3 90 

Ser Gly lie Pro Leu Asn Leu Thr Leu Asp Asn Arg Val Ala Asp 
45 Gin 

100 105 110 

115 

eta tgg gtg ccc gac aca tat ttc tta aat gac aaa aag tea ttt 
50 gtg 438 

Leu Trp Val Pro Asp Thr Tyr Phe Leu Asn Asp Lys Lys Ser Phe 
Val 
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120 125 130 

cat gga gtg aca gtg aaa aac cgc atg ate cgt ctt cac cct gat 
ggg 4 86 

5 His Gly Val Thr Val Lys Asn Arg Met He Arg Leu His Pro Asp 
Gly 

135 140 145 

aca gtg ctg tat ggg etc aga ate acc acg aca gca gca tgc atg 
10 atg 534 

Thr Val Leu Tyr Gly Leu Arg He Thr Thr Thr Ala Ala Cys Met 
Met 

150 155 160 

15 gac etc agg aga tac ccc ctg gac gag cag aac tgc act ctg gaa 
att 582 

Asp Leu Arg Arg Tyr Pro Leu Asp Glu Gin Asn Cys Thr Leu Glu 
He 

165 170 175 

20 

gaa age tat ggc tac acc acg gat gac att gag ttt tac tgg cga 
ggc 630 

Glu Ser Tyr Gly Tyr Thr Thr Asp Asp He Glu Phe Tyr Trp Arg 
Gly 

25 180 185 190 

195 

ggg gac aag get gtt acc gga gtg gaa agg att gag etc ccg cag 
ttc 678 

30 Gly Asp Lys Ala Val Thr Gly Val Glu Arg lie Glu Leu Pro Gin 
Phe 

200 205 210 

tec ate gtg gag cac cgt ctg gtc teg agg aat gtt gtc ttc gec 
35 aca 726 

Ser He Val Glu His Arg Leu Val Ser Arg Asn Val Val Phe Ala 
Thr 

215 220 225 

40 ggt gec tat cct cga ctg tea ctg age ttt egg ttg aag agg aac 
att 774 

Gly Ala Tyr Pro Arg Leu Ser Leu Ser Phe Arg Leu Lys Arg Asn 
He 

230 235 240 

45 

gga tac ttc att ctt cag act tat atg ccc tct ata ctg ata acg 
att 822 

Gly Tyr Phe He Leu Gin Thr Tyr Met Pro Ser He Leu He Thr 
He 

50 245 250 255 
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ctg teg tgg gtg 
aga 870 

Leu Ser Trp Val 
Arg 
5 260 
275 

gtt gec etc ggg 
acc 918 
10 Val Ala Leu Gly 
Thr 



cac ctt egg gag 
15 gac 966 

His Leu Arg Glu 
Asp 

295 

20 atg tac ctt atg 
gag 1014 

Met Tyr Leu Met 
Glu 

310 

25 

tat gee ttt gtc 
cag 1062 

Tyr Ala Phe Val 
Gin 

30 325 

aag aag ctt gca 
aag 1110 

Lys Lys Leu Ala 
35 Lys 

340 
355 

age gaa age aac 
40 teg 1158 

Ser Glu Ser Asn 
Ser 



45 ctg gaa gtt cac 
gat 1206 

Leu Glu Val His 
Asp 

375 

50 

acc agg aat tea 
agg 1254 



tec ttc tgg ate aat 
Ser Phe Trp lie Asn 
265 

ate aca act gtg ctg 
He Thr Thr Val Leu 
280 

acc ttg ccc aaa ate 
Thr Leu Pro Lys He 
300 

ggc tgc ttc gtc ttt 
Gly Cys Phe Val Phe 
315 

aac tac att ttc ttt 
Asn Tyr He Phe Phe 
330 

gaa aag aca gec aag 
Glu Lys Thr Ala Lys 
345 

egg gtg gat get cat 
Arg Val Asp Ala His 
360 

aat gaa atg aat gag 
Asn Glu Met Asn Glu 
380 

gca ata tec ttt gac 
59 
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tat gat gca tct get get 

Tyr Asp Ala Ser Ala Ala 

270 

aca atg aca acc ate aac 
Thr Met Thr Thr He Asn 
285 290 
ccc tat gtc aaa gee att 
Pro Tyr Val Lys Ala He 
305 

gtg ttc ctg gee ctt ctg 
Val Phe Leu Ala Leu Leu 
320 

gga aga ggc cct caa agg 
Gly Arg Gly Pro Gin Arg 
335 

gca aag aat gac cgt tea 
Ala Lys Asn Asp Arg Ser 
- 350 

gga aat att ctg ttg aca 
Gly Asn He Leu Leu Thr 
365 370 
gtc tea ggc ggc att ggc 
Val Ser Gly Gly He Gly 
385 

aac tea gga ate cag tac 
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Thr Arg Asn 
Arg 

390 

5 aaa cag age 
aga 1302 
Lys Gin Ser 
Arg 

405 

10 

age etc ccg 
etc 1350 

Ser Leu Pro 
Leu 
15 420 
435 

aaa att aaa 

tgg 1398 

20 Lys lie Lys 
Trp 



tec agg ate 
25 tac 1446 
Ser Arg lie 
Tyr 



30 tgg ctg tac 
1467 
Trp Leu Tyr 
470 

35 



Ser Ala lie Ser 

atg cct cga gaa 
Met Pro Arg Glu 
410 

cac aag aag ace 
His Lys Lys Thr 
425 

ata cct gat eta 
lie Pro Asp Leu 
440 

gtg ttt cca ttc 
Val Phe Pro Phe 
455 

tat gtt aac tga 
Tyr Val Asn 



Phe Asp Asn Ser 
395 

ggg cat ggg cga 
Gly His Gly Arg 

cat eta egg agg 
His Leu Arg Arg 
430 

ace gat gtg aat 
Thr Asp Val Asn 
445 

act ttt tct ctt 
Thr Phe Ser Leu 
460 
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Gly lie Gin Tyr 

400 

ttc ctg ggg gac 
Phe Leu Gly Asp 
415 

agg tct tea cag 
Arg Ser Ser Gin 

gee ata gac aga 
Ala lie Asp Arg 
450 

ttc aac tta gtt 
Phe Asn Leu Val 
465 



<210> 11 
40 <211> 1024 

<212> DNA 
<213> Homo sapiens 



45 



<220> 

<221> CDS 

<222> (216) . . (686) 



<400> 11 

gecaagatag atcaactctc cctaaaggct gacagtgaac tettggggee 
50 gttttattct 60 
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ctgaggttag caaggagtca tctactagcc attcaggagg ccagctggga 
agacaaaata 120 

ggcaccccaa actcagcaac ttcataacac cttcctctcc ccgcctgaag 
5 ccttaaactg 180 

catcaagtca aagaaacctg gggcaaatcc ttaac atg ttt ttg act gca 
gta 233 

Met Phe Leu Thr Ala 

10 Val 

1 5 

aat cca cag cca etc tct act ccg age tgg cag att gag acc aag 
tat 281 

15 Asn Pro Gin Pro Leu Ser Thr Pro Ser Trp Gin He Glu Thr Lys 
Tyr 

10 15 20 

tea acg aaa gtg etc act gga aat tgg atg gaa gag agg aga aag 
20 ttc ~ 329 

Ser Thr Lys Val Leu Thr Gly Asn Trp Met Glu Glu Arg Arg Lys 
Phe 

25 30 35 

25 acc aga gac act gac aag aca ccc caa tec att tac aga aaa gaa 
tac 377 

Thr Arg Asp Thr Asp Lys Thr Pro Gin Ser He Tyr Arg Lys Glu 
Tyr 

40 45 50 

30 

ate ccc ttc cca gac cac aga cca gac cag ate tec agg tgg tat 
ggg 425 

He Pro Phe Pro Asp His Arg Pro Asp Gin He Ser Arg Trp Tyr 
Gly 

35 55 60 65 

70 

aag agg aaa gtt gag ggg eta cct tac aaa cac ctg ate acc ccc 
cac 473 

40 Lys Arg Lys Val Glu Gly Leu Pro Tyr Lys His Leu lie Thr Pro 
His 

75 80 85 

cag gag ccc cca cat cgt tac ttg ate age acc tat gac gac cat 
45 tac 521 

Gin Glu Pro Pro His Arg Tyr Leu He Ser Thr Tyr Asp Asp His 
Tyr 

90 95 100 

50 aac egg cat ggt tac aac ccg ggc tgc etc cac tec gca ctt gga 
atg 569 

61 
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Asn Arg His Gly Tyr Asn Pro Gly Cys Leu His Ser Ala Leu Gly 
Met 

105 110 115 

gac aga agt tgc tgt ggc tgc cag aga agt ctg act ttc ccc ttc 
ttg 617 

Asp Arg Ser Cys Cys Gly Cys Gin Arg Ser Leu Thr Phe Pro Phe 
Leu 

120 125 130 



etc ccc eta caa act atg gac tct atg age age tea age aga gac 
age 665 

Leu Pro Leu Gin Thr Met Asp Ser Met Ser Ser Ser Ser Arg Asp 
Ser 

15 135 140 145 

150 

tea cac cca agg ctg gee tga ag cagagcactt atacttcatc 
ctaccccaga 718 
20 Ser His Pro Arg Leu Ala 

155 

ccaccgttgt gcgctatgtc ctggagggag catgcggtcc cggtccctcc 
ccatcgcctg 778 



catcctctcc cacacttctg agagctgeca ccccaggagc agctcagata 
gaatcagctg 838 



gagaccacag catcactgga cttgccagac aacaagtggc gcagataaac 
30 tcagagtacg 898 

agatctggee cgtcaaaggt gctctcagaa tcatcatctg catttggegg 
tacctgtccc 958 

35 ccctcaaaac ccacaggttc ctttcttttc catccaacaa ttaaagatct 
ttgacactaa 1018 



aaaaaa 
1024 



<210> 12 
<211> 2556 
<212> PRT 
45 <213> Homo sapiens 



<400> 12 

Met Pro Pro Leu Leu Ala Pro Leu Leu Cys Leu Ala Leu Leu Pro 
50 Ala 

15 10 15 
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Leu Ala Ala Arg Gly Pro Arg Cys Ser Gin Pro Gly Glu Thr Cys 
Leu 

20 25 30 

Asn Gly Gly Lys Cys Glu Ala Ala Asn Gly Thr Glu Ala Cys Val 
5 Cys 

35 40 45 

Gly Gly Ala Phe Val Gly Pro Arg Cys Gin Asp Pro Asn Pro Cys 
Leu 

50 55 60 

10 Ser Thr Pro Cys Lys Asn Ala Gly Thr Cys His Val Val Asp Arg 
Arg 

65 70 75 

80 

Gly Val Ala Asp Tyr Ala Cys Ser Cys Ala Leu Gly Phe Ser Gly 
15 Pro 

85 90 95 

Leu Cys Leu Thr Pro Leu Asp Asn Ala Cys Leu Thr Asn Pro Cys 
Arg 

100 105 110 

20 Asn Gly Gly Thr Cys Asp Leu Leu Thr Leu Thr Glu Tyr Lys Cys 
Arg 

115 120 125 

Cys Pro Pro Gly Trp Ser Gly Lys Ser Cys Gin Gin Ala Asp Pro 
Cys 

25 130 135 140 

Ala Ser Asn Pro Cys Ala Asn Gly Gly Gin Cys Leu Pro Phe Glu 
Ala 

145 150 155 

160 

30 Ser Tyr lie Cys His Cys Pro Pro Ser Phe His Gly Pro Thr Cys 
Arg 

165 170 175 

Gin Asp Val Asn Glu Cys Gly Gin Lys Pro Arg Leu Cys Arg His 
Gly 

35 180 185 190 

Gly Thr Cys His Asn Glu Val Gly Ser Tyr Arg Cys Val Cys Arg 
Ala 

195 200 205 

Thr His Thr Gly Pro Asn Cys Glu Arg Pro Tyr Val Pro Cys Ser 
40 Pro 

210 215 220 

Ser Pro Cys Gin Asn Gly Gly Thr Cys Arg Pro Thr Gly Asp Val 
Thr 

225 230 235 

45 240 

His Glu Cys Ala Cys Leu Pro Gly Phe Thr Gly Gin Asn Cys Glu 
Glu 

245 250 255 

Asn lie Asp Asp Cys Pro Gly Asn Asn Cys Lys Asn Gly Gly Ala 
50 Cys 

260 265 270 
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Val Asp Gly 
Thr 

275 

Gly Gin Tyr 
5 Asn 

290 

Ala Cys Gin 
Asn 

305 
10 320 

Cys Val Cys 
He 

Asp Asp Cys 
15 Asp 

Arg Val Ala 
Leu 

355 

20 Leu Cys His 
Gly 

370 
Ser Asn Cys 
Cys 
25 385 
400 

Pro Ser Gly 
Cys 

30 Ser Leu Gly 
Thr 

Leu Gly Ser 
Arg 

35 435 
Cys Glu He 
Asp 

450 
Ala Thr Cys 
40 Pro 
465 
480 

Gly Tyr Glu 
Ser 

45 

Ser Pro Cys 
Phe 

Gin Cys Glu 
50 Asp 

515 



Val Asn Thr Tyr 

Cys Thr Glu Asp 
295 

Asn Gly Gly Thr 
310 

Val Asn Gly Trp 
325 

Ala Ser Ala Ala 
340 

Ser Phe Tyr Cys 

Leu Asn Asp Ala 
375 

Asp Thr Asn Pro 
390 

Tyr Thr Gly Pro 
405 

Ala Asn Pro Cys 
420 

Phe Glu Cys Gin 

Asp Val Asn Glu 
455 

Leu Asp Gin He 
470 

Gly Val His Cys 
485 

Leu His Asn Gly 
500 

Cys Pro Thr Gly 



Asn Cys Pro Cys 
280 

Val Asp Glu Cys 

Cys His Asn Thr 
315 

Thr Gly Glu Asp 
330 

Cys Phe His Gly 
345 

Glu Cys Pro His 
360 

Cys He Ser Asn 

Val Asn Gly Lys 
395 

Ala Cys Ser Gin 
410 

Glu His Ala Gly 
425 

Cys Leu Gin Gly 
440 

Cys Val Ser Asn 

Gly Glu Phe Gin 
475 

Glu Val Asn Thr 
490 

Arg Cys Leu Asp 
505 

Phe Thr Gly His 
520 
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Pro Pro Glu Trp 

285 

Gin Leu Met Pro 
300 

His Gly Gly Tyr 



Cys Ser Glu Asn 
335 

Ala Thr Cys His 
350 

Gly Arg Thr Gly 
365 

Pro Cys Asn Glu 
380 

Ala He Cys Thr 



Asp Val Asp Glu 
415 

Lys Cys He Asn 
430 

Tyr Thr Gly Pro 
445 

Pro Cys Gin Asn 
460 

Cys Met Cys Met 



Asp Glu Cys Ala 
495 

Lys lie Asn Glu 
510 

Leu Cys Gin Tyr 
525 
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Val Asp Glu Cys Ala Ser Thr Pro Cys Lys Asn Gly Ala Lys Cys 
Leu 

530 535 540 

Asp Gly Pro Asn Thr Tyr Thr Cys Val Cys Thr Glu Gly Tyr Thr 
5 Gly 

545 550 555 

560 

Thr His Cys Glu Val Asp He Asp Glu Cys Asp Pro Asp Pro Cys 
His 

10 565 570 575 

Tyr Gly Ser Cys Lys Asp Gly Val Ala Thr Phe Thr Cys Leu Cys 
Arg 

580 585 590 

Pro Gly Tyr Thr Gly His His Cys Glu Thr Asn He Asn Glu Cys 
15 Ser 

595 600 605 

Ser Gin Pro Cys Arg Leu Arg Gly Thr Cys Gin Asp Pro Asp Asn 
Ala 

610 615 620 

20 Tyr Leu Cys Phe Cys Leu Lys Gly Thr Thr Gly Pro Asn Cys Glu 
He 

625 630 635 

640 

Asn Leu Asp Asp Cys Ala Ser Ser Pro Cys Asp Ser Gly Thr Cys 
25 Leu 

645 650 655 

Asp Lys He Asp Gly Tyr Glu Cys Ala Cys Glu Pro Gly Tyr Thr 
Gly 

660 665 670 

30 Ser Met Cys Asn Ser Asn He Asp Glu Cys Ala Gly Asn Pro Cys 
His 

675 680 685 

Asn Gly Gly Thr Cys Glu Asp Gly He Asn Gly Phe Thr Cys Arg 
Cys 

35 690 695 700 

Pro Glu Gly Tyr His Asp Pro Thr Cys Leu Ser Glu Val Asn Glu 
Cys 

705 710 715 

720 

40 Asn Ser Asn Pro Cys Val His Gly Ala Cys Arg Asp Ser Leu Asn 
Gly 

725 730 735 

Tyr Lys Cys Asp Cys Asp Pro Gly Trp Ser Gly Thr Asn Cys Asp 
He 

45 740 745 750 

Asn Asn Asn Glu Cys Glu Ser Asn Pro Cys Val Asn Gly Gly Thr 
Cys 

755 760 765 

Lys Asp Met Thr Ser Gly He Val Cys Thr Cys Arg Glu Gly Phe 
50 Ser 

770 775 780 
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Gly Pro Asn Cys Gin Thr Asn lie Asn Glu Cys Ala Ser Asn Pro 
Cys 

785 790 795 

800 

5 Leu Asn Lys Gly Thr Cys He Asp Asp Val Ala Gly Tyr Lys Cys 
Asn 

805 810 815 

Cys Leu Leu Pro Tyr Thr Gly Ala Thr Cys Glu Val Val Leu Ala 
Pro 

10 820 825 830 

Cys Ala Pro Ser Pro Cys Arg Asn Gly Gly Glu Cys Arg Gin Ser 
Glu 

835 840 845 

Asp Tyr Glu Ser Phe Ser Cys Val Cys Pro Thr Ala Gly Ala Lys 
15 Gly 

850 855 860 

Gin Thr Cys Glu Val Asp lie Asn Glu Cys Val Leu Ser Pro Cys 
Arg 

865 870 875 

20 880 

His Gly Ala Ser Cys Gin Asn Thr His Gly Gly Tyr Arg Cys His 
Cys 

885 890 895 

Gin Ala Gly Tyr Ser Gly Arg Asn Cys Glu Thr Asp He Asp Asp 
25 Cys 

900 905 910 

Arg Pro Asn Pro Cys His Asn Gly Gly Ser Cys Thr Asp Gly He 
Asn 

915 920 925 

30 Thr Ala Phe Cys Asp Cys Leu Pro Gly Phe Arg Gly Thr Phe Cys 
Glu 

930 935 940 

Glu Asp He Asn Glu Cys Ala Ser Asp Pro Cys Arg Asn Gly Ala 
Asn 

35 945 950 955 

960 

Cys Thr Asp Cys Val Asp Ser Tyr Thr Cys Thr Cys Pro Ala Gly 
Phe 

965 970 975 

40 Ser Gly He His Cys Glu Asn Asn Thr Pro Asp Cys Thr Glu Ser 
Ser 

980 985 990 

Cys Phe Asn Gly Gly Thr Cys Val Asp Gly He Asn Ser Phe Thr 
Cys 

45 995 1000 1005 

Leu Cys Pro Pro Gly Phe Thr Gly Ser Tyr Cys Gin His Val Val 
Asn 

1010 1015 1020 

Glu Cys Asp Ser Arg Pro Cys Leu Leu Gly Gly Thr Cys Gin Asp 
50 Gly 

1025 1030 1035 

1040 
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Arg Gly Leu His Arg Cys Thr Cys Pro Gin Gly Tyr Thr Gly Pro 
Asn 

1045 1050 1055 

Cys Gin Asn Leu Val His Trp Cys Asp Ser Ser Pro Cys Lys Asn 
5 Gly 

1060 1065 1070 

Gly Lys Cys Trp Gin Thr His Thr Gin Tyr Arg Cys Glu Cys Pro 
Ser 

1075 1080 1085 

10 Gly Trp Thr Gly Leu Tyr Cys Asp Val Pro Ser Val Ser Cys Glu 
Val 

1090 1095 1100 

Ala Ala Gin Arg Gin Gly Val Asp Val Ala Arg Leu Cys Gin His 
Gly 

15 1105 1110 1115 

1120 

Gly Leu Cys Val Asp Ala Gly Asn Thr His His Cys Arg Cys Gin 
Ala 

1125 1130 1135 

20 Gly Tyr Thr Gly Ser Tyr Cys Glu Asp Leu Val Asp Glu Cys Ser 
Pro 

1140 1145 1150 

Ser Pro Cys Gin Asn Gly Ala Thr Cys Thr Asp Tyr Leu Gly Gly 
Tyr 

25 1155 1160 1165 

Ser Cys Lys Cys Val Ala Gly Tyr His Gly Val Asn Cys Ser Glu 
Glu 

1170 1175 1180 

lie Asp Glu Cys Leu Ser His Pro Cys Gin Asn Gly Gly Thr Cys 
30 Leu 

1185 1190 1195 

1200 

Asp Leu Pro Asn Thr Tyr Lys Cys Ser Cys Pro Arg Gly Thr Gin 
Gly 

35 1205 1210 1215 

Val His Cys Glu lie Asn Val Asp Asp Cys Asn Pro Pro Val Asp 
Pro 

1220 1225 1230 

Val Ser Arg Ser Pro Lys Cys Phe Asn Asn Gly Thr Cys Val Asp 
40 Gin 

1235 1240 1245 

Val Gly Gly Tyr Ser Cys Thr Cys Pro Pro Gly Phe Val Gly Glu 
Arg 

1250 1255 1260 

45 Cys Glu Gly Asp Val Asn Glu Cys Leu Ser Asn Pro Cys Asp Ala 
Arg 

1265 1270 1275 

1280 

Gly Thr Gin Asn Cys Val Gin Arg Val Asn Asp Phe His Cys Glu 
50 Cys 

1285 1290 1295 
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Arg Ala Gly His Thr Gly Arg Arg Cys Glu Ser Val lie Asn Gly 
Cys 

1300 1305 1310 

Lys Gly Lys Pro Cys Lys Asn Gly Gly Thr Cys Ala Val Ala Ser 
5 Asn 

1315 1320 1325 

Thr Ala Arg Gly Phe He Cys Lys Cys Pro Ala Gly Phe Glu Gly 
Ala 

1330 1335 1340 

10 Thr Cys Glu Asn Asp Ala Arg Thr Cys Gly Ser Leu Arg Cys Leu 
Asn 

1345 1350 1355 

1360 

Gly Gly Thr Cys He Ser Gly Pro Arg Ser Pro Thr Cys Leu Cys 
15 Leu 

1365 1370 1375 

Gly Pro Phe Thr Gly Pro Glu Cys Gin Phe Pro Ala Ser Ser Pro 
Cys 

1380 1385 1390 

20 Leu Gly Gly Asn Pro Cys Tyr Asn Gin Gly Thr Cys Glu Pro Thr 
Ser 

1395 1400 1405 

Glu Ser Pro Phe Tyr Arg Cys Leu Cys Pro Ala Lys Phe Asn Gly 
Leu 

25 1410 1415 1420 

Leu Cys His He Leu Asp Tyr Ser Phe Gly Gly Gly Ala Gly Arg 
Asp 

1425 1430 1435 

1440 

30 He Pro Pro Pro Leu He Glu Glu Ala Cys Glu Leu Pro Glu Cys 
Gin 

1445 1450 1455 

Glu Asp Ala Gly Asn Lys Val Cys Ser Leu Gin Cys Asn Asn His 
Ala 

35 1460 1465 1470 

Cys Gly Trp Asp Gly Gly Asp Cys Ser Leu Asn Phe Asn Asp Pro 
Trp 

1475 1480 1485 

Lys Asn Cys Thr Gin Ser Leu Gin Cys Trp Lys Tyr Phe Ser Asp 
40 Gly 

1490 1495 1500 

His Cys Asp Ser Gin Cys Asn Ser Ala Gly Cys Leu Phe Asp Gly 
Phe 

1505 1510 1515 

45 1520 

Asp Cys Gin Arg Ala Glu Gly Gin Cys Asn Pro Leu Tyr Asp Gin 
Tyr 

1525 1530 1535 

Cys Lys Asp His Phe Ser Asp Gly His Cys Asp Gin Gly Cys Asn 
50 Ser 

1540 1545 1550 
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Ala Glu Cys Glu Trp Asp Gly Leu Asp Cys Ala Glu His Val Pro 
Glu 

1555 1560 1565 

Arg Leu Ala Ala Gly Thr Leu Val Val Val Val Leu Met Pro Pro 
5 Glu 

1570 1575 1580 

Gin Leu Arg Asn Ser Ser Phe His Phe Leu Arg Glu Leu Ser Arg 
Val 

1585 1590 1595 

10 1600 

Leu His Thr Asn Val Val Phe Lys Arg Asp Ala His Gly Gin Gin 
Met 

1605 1610 1615 

lie Phe Pro Tyr Tyr Gly Arg Glu Glu Glu Leu Arg Lys His Pro 
15 He 

1620 1625 1630 

Lys Arg Ala Ala Glu Gly Trp Ala Ala Pro Asp Ala Leu Leu Gly 
Gin 

1635 1640 1645 

20 Val Lys Ala Ser Leu Leu Pro Gly Gly Ser Glu Gly Gly Arg Arg 
Arg 

1650 1655 1660 

Arg Glu Leu Asp Pro Met Asp Val Arg Gly Ser He Val Tyr Leu 
Glu 

25 1665 1670 1675 

1680 

He Asp Asn Arg Gin Cys Val Gin Ala Ser Ser Gin Cys Phe Gin 
Ser 

1685 1690 1695 

30 Ala Thr Asp Val Ala Ala Phe Leu Gly Ala Leu Ala Ser Leu Gly 
Ser 

1700 1705 1710 

Leu Asn He Pro Tyr Lys He Glu Ala Val Gin Ser Glu Thr Val 
Glu 

35 1715 1720 1725 

Pro Pro Pro Pro Ala Gin Leu His Phe Met Tyr Val Ala Ala Ala 
Ala 

1730 1735 1740 

Phe Val Leu Leu Phe Phe Val Gly Cys Gly Val Leu Leu Ser Arg 
40 Lys 

1745 1750 1755 

1760 

Arg Arg Arg Gin His Gly Gin Leu Trp Phe Pro Glu Gly Phe Lys 
Val 

45 1765 1770 1775 

Ser Glu Ala Ser Lys Lys Lys Arg Arg Glu Pro Leu Gly Glu Asp 
Ser 

1780 1785 * 1790 

Val Gly Leu Lys Pro Leu Lys Asn Ala Ser Asp Gly Ala Leu Met 
50 Asp 

1795 1800 1805 
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Asp Asn Gin Asn Glu Trp Gly Asp Glu Asp Leu Glu Thr Lys Lys 
Phe 

1810 1815 1820 

Arg Phe Glu Glu Pro Val Val Leu Pro Asp Leu Asp Asp Gin Thr 
5 Asp 

1825 1830 1835 

1840 

His Arg Gin Trp Thr Gin Gin His Leu Asp Ala Ala Asp Leu Arg 
Met 

10 1845 1850 1855 

Ser Ala Met Ala Pro Thr Pro Pro Gin Gly Glu Val Asp Ala Asp 
Cys 

1860 1865 1870 

Met Asp Val Asn Val Arg Gly Pro Asp Gly Phe Thr Pro Leu Met 
15 He 

1875 1880 1885 

Ala Ser Cys Ser Gly Gly Gly Leu Glu Thr Gly Asn Ser Glu Glu 
Glu 

1890 1895 1900 

20 Glu Asp Ala Pro Ala Val He Ser Asp Phe He Tyr Gin Gly Ala 
Ser 

1905 1910 1915 

1920 

Leu His Asn Gin Thr Asp Arg Thr Gly Glu Thr Ala Leu His Leu 
25 Ala 

1925 1930 1935 

Ala Arg Tyr Ser Arg Ser Asp Ala Ala Lys Arg Leu Leu Glu Ala 
Ser 

1940 1945 1950 

30 Ala Asp Ala Asn He Gin Asp Asn Met Gly Arg Thr Pro Leu His 
Ala 

1955 1960 1965 

Ala Val Ser Ala Asp Ala Gin Gly Val Phe Gin He Leu He Arg 
Asn 

35 1970 1975 1980 

Arg Ala Thr Asp Leu Asp Ala Arg Met His Asp Gly Thr Thr Pro 
Leu 

1985 1990 1995 

2000 

40 He Leu Ala Ala Arg Leu Ala Val Glu Gly Met Leu Glu Asp Leu 
He 

2005 2010 2015 

Asn Ser His Ala Asp Val Asn Ala Val Asp Asp Leu Gly Lys Ser 
Ala 

45 2020 2025 2030 

Leu His Trp Ala Ala Ala Val Asn Asn Val Asp Ala Ala Val Val 
Leu 

2035 2040 2045 

Leu Lys Asn Gly Ala Asn Lys Asp Met Gin Asn Asn Arg Glu Glu 
50 Thr 

2050 2055 2060 
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Pro Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Thr Ala Lys 
Val 

2065 2070 2075 

2080 

5 Leu Leu Asp His Phe Ala Asn Arg Asp lie Thr Asp His Met Asp 
Arg 

2085 2090 2095 

Leu Pro Arg Asp He Ala Gin Glu Arg Met His His Asp He Val 
Arg 

10 2100 2105 2110 

Leu Leu Asp Glu Tyr Asn Leu Val Arg Ser Pro Gin Leu His Gly 
Ala 

2115 2120 2125 

Pro Leu Gly Gly Thr Pro Thr Leu Ser Pro Pro Leu Cys Ser Pro 
15 Asn 

2130 2135 2140 

Gly Tyr Leu Gly Ser Leu Lys Pro Gly Val Gin Gly Lys Lys Val 
Arg 

2145 2150 2155 

20 2160 

Lys Pro Ser Ser Lys Gly Leu Ala Cys Gly Ser Lys Glu Ala Lys 
Asp 

2165 2170 2175 

Leu Lys Ala Arg Arg Lys Lys Ser Gin Asp Gly Lys Gly Cys Leu 
25 Leu 

2180 2185 2190 

Asp Ser Ser Gly Met Leu Ser Pro Val Asp Ser Leu Glu Ser Pro 
His 

2195 2200 2205 

30 Gly Tyr Leu Ser Asp Val Ala Ser Pro Pro Leu Leu Pro Ser Pro 
Phe 

2210 2215 2220 

Gin Gin Ser Pro Ser Val Pro Leu Asn His Leu Pro Gly Met Pro 
Asp 

35 2225 2230 2235 

2240 

Thr His Leu Gly He Gly His Leu Asn Val Ala Ala Lys Pro Glu 
Met 

2245 2250 2255 

40 Ala Ala Leu Gly Gly Gly Gly Arg Leu Ala Phe Glu Thr Gly Pro 
Pro 

2260 2265 2270 

Arg Leu Ser His Leu Pro Val Ala Ser Gly Thr Ser Thr Val Leu 
Gly 

45 2275 2280 2285 

Ser Ser Ser Gly Gly Ala Leu Asn Phe Thr Val Gly Gly Ser Thr 
Ser 

2290 2295 2300 

Leu Asn Gly Gin Cys Glu Trp Leu Ser Arg Leu Gin Ser Gly Met 
50 Val 

2305 2310 2315 

2320 
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Pro Asn Gin Tyr Asn Pro Leu Arg Gly Ser Val Ala Pro Gly Pro 
Leu 

2325 2330 2335 

Ser Thr Gin Ala Pro Ser Leu Gin His Gly Met Val Gly Pro Leu 
5 His 

2340 2345 2350 

Ser Ser Leu Ala Ala Ser Ala Leu Ser Gin Met Met Ser Tyr Gin 
Gly 

2355 2360 2365 

10 Leu Pro Ser Thr Arg Leu Ala Thr Gin Pro His Leu Val Gin Thr 
Gin 

2370 2375 2380 

Gin Val Gin Pro Gin Asn Leu Gin Met Gin Gin Gin Asn Leu Gin 
Pro 

15 2385 2390 2395 

2400 

Ala Asn lie Gin Gin Gin Gin Ser Leu Gin Pro Pro Pro Pro Pro 
Pro 

2405 2410 2415 

20 Gin Pro His Leu Gly Val Ser Ser Ala Ala Ser Gly His Leu Gly 
Arg 

2420 2425 2430 

Ser Phe Leu Ser Gly Glu Pro Ser Gin Ala Asp Val Gin Pro Leu 
Gly 

25 2435 2440 2445 

Pro Ser Ser Leu Ala Val His Thr lie Leu Pro Gin Glu Ser Pro 
Ala 

2450 2455 2460 

Leu Pro Thr Ser Leu Pro Ser Ser Leu Val Pro Pro Val Thr Ala 
30 Ala 

2465 2470 2475 

2480 

Gin Phe Leu Thr Pro Pro Ser Gin His Ser Tyr Ser Ser Pro Val 
Asp 

35 2485 2490 2495 

Asn Thr Pro Ser His Gin Leu Gin Val Pro Glu His Pro Phe Leu 
Thr 

2500 2505 2510 

Pro Ser Pro Glu Ser Pro Asp Gin Trp Ser Ser Ser Ser Pro His 
40 Ser 

2515 2520 2525 

Asn Val Ser Asp Trp Ser Glu Gly Val Ser Ser Pro Pro Thr Ser 
Met 

2530 2535 2540 

45 Gin Ser Gin lie Ala Arg lie Pro Glu Ala Phe Lys 
2545 2550 2555 



50 <210> 13 

<211> 657 
<212> PRT 
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<400> 13 

5 Met His Ala Leu Gly He Ser Ala Ser Thr Trp Thr Thr Thr Thr 
Ala 

15 10 15 

Arg Pro Leu Pro Leu Pro Leu Pro Pro Ala Ala Pro Ala Ala Leu 
Ser 

10 20 25 30 

He His Tyr Lys Ser Met Asp He Val Leu Thr Val Thr Met Val 
His 

35 40 45 

Gly Lys Glu Glu Gly Leu He Leu Phe Asp Pro He Pro Val Ser 
15 Ser 

50 55 60 

Gly Phe Ser Lys Asn Gly Val Leu Val Ser Val Leu Gly Thr Thr 
Thr 

65 70 75 

20 80 

Met Ala Val Asp He Pro Ala Leu Gly Val Ser Val Thr Phe Asn 
Gly 

85 90 95 

Gin Val Phe Gin Ala Arg Leu Pro Tyr Ser Leu Phe His Asn Asn 
25 Thr 

100 105 110 

Glu Gly Gin Cys Gly Thr Cys Thr Asn Asn Gin Arg Asp Asp Cys 
Leu 

115 120 125 

30 Gin Arg Asp Gly Thr Thr Ala Ala Ser Cys Lys Asp Met Ala Lys 
Thr 

130 135 140 

Trp Leu Val Pro Asp Ser Arg Lys Asp Gly Cys Trp Ala Pro Thr 
Gly 

35 145 150 155 

160 

Thr Pro Pro Thr Ala Ser Pro Ala Ala Pro Val Ser Ser Thr Pro 
Thr 

165 170 175 

40 Pro Thr Pro Cys Pro Pro Gin Leu Leu Cys Asp Leu Met Leu Ser 
Gin 

180 185 190 

Val Phe Ala Glu Cys His Thr Leu Leu Pro Pro Gly Pro Phe Phe 
Asn 

45 195 200 205 

Ala Cys He Ser Asp His Cys Arg Gly Arg Leu Glu Val Pro Cys 
Gin 

210 215 220 

Ser Leu Glu Ala Tyr Ala Glu Leu Cys Arg Ala Arg Gly Val Cys 
50 Ser 

225 230 235 

240 
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Asp Trp Arg Gly Ala Thr Gly Gly Leu Cys Asp Leu Thr Cys Pro 
Pro 

245 250 255 

Thr Lys Val Tyr Lys Pro Cys Gly Pro He Gin Pro Ala Thr Cys 
5 Asn 

260 265 270 

Ser Arg Asn Gin Ser Pro Gin Leu Glu Gly Met Ala Glu Gly Cys 
Phe 

275 280 285 

10 Cys Pro Glu Asp Gin He Leu Phe Asn Ala His Met Gly He Cys 
Val 

290 295 300 

Gin Ala Cys Pro Cys Val Gly Pro Asp Gly Phe Pro Lys Phe Pro 
Gly 

15 305 310 315 

320 

Glu Arg Trp Val Ser Asn Cys Gin Ser Cys Val Cys Asp Glu Gly 
Ser 

325 330 335 

20 Val Ser Val Gin Cys Lys Pro Leu Pro Cys Asp Ala Gin Gly Gin 
Pro 

340 345 350 

Pro Pro Cys Asn Arg Pro Gly Phe Val Thr Val Thr Arg Pro Arg 
Ala 

25 355 360 365 

Glu Asn Pro Cys Cys Pro Glu Thr Val Cys Val Cys Asn Thr Thr 
Thr 

370 375 380 

Cys Pro Gin Ser Leu Pro Val Cys Pro Pro Gly Gin Glu Ser He 
30 Cys 

385 390 395 

400 

Thr Gin Glu Glu Gly Asp Cys Cys Pro Thr Phe Arg Cys Arg Pro 
Gin 

35 405 410 415 

Leu Cys Ser Tyr Asn Gly Thr Phe Tyr Gly Val Gly Ala Thr Phe 
Pro 

420 425 430 

Gly Ala Leu Pro Cys His Met Cys Thr Cys Leu Ser Gly Asp Thr 
40 Gin 

435 440 445 

Asp Pro Thr Val Gin Cys Gin Glu Asp Ala Cys Asn Asn Thr Thr 
Cys 

450 455 460 

45 Pro Gin Gly Phe Glu Tyr Lys Arg Val Ala Gly Gin Cys Cys Gly 
Glu 

465 470 475 

480 

Cys Val Gin Thr Ala Cys Leu Thr Pro Asp Gly Gin Pro Val Gin 
50 Leu 

485 490 495 
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Asn 


Glu 


Thr 


Trp 


Val 


Asn Ser 


His 


Val 


Asp 


Asn 


Cys 


Thr 


Val 


Tyr 




Leu 






500 








505 










510 






Cys 


Glu 


Ala 


Glu Gly 


Gly Val 


His 


Leu 


Leu 


Thr 


Pro 


Gin 


Pro 


Ala 


5 


Ser 




515 








520 










525 








Cys 


Pro 


Asp 


Val 


Ser 


Ser Cys Arg Gly Ser Leu Arg Lys Thr Gly 




Cys 


530 








535 










540 








10 


Cys 
Thr 

545 
560 


Tyr 


Ser 


Cys 


Glu 


Glu Asp 
550 


Ser 


Cys 


Gin 


Val 
555 


Arg 


He 


Asn 


Thr 




He 


Leu 


Trp 


His 


Gin 


Gly Cys 


Glu 


Thr 


Glu 


Val 


Asn 


He 


Thr 


Phe 


15 


Cys 








565 








570 










575 




Glu 


Gly 


Ser 


Cys 


Pro 


Gly Ala 


Ser 


Lys 


Tyr 


Ser 


Ala 


Glu 


Ala 


Gin 




Ala 






580 








585 










590 




20 


Met 
Glu 


Gin 


His 
595 


Gin 


Cys 


Thr Cys 


Cys 
600 


Gin 


Glu 


Arg 


Arg 


Val 
605 


His 


Glu 




Thr 


Val 


Pro 


Leu 


His 


Cys Pro Asn Gly Ser Ala 


He 


Leu 


His 


Thr 




Tyr 




























25 




610 








615 










620 










Thr 


His 


Val 


Asp 


Glu 


Cys Gly Cys 


Thr 


Pro 


Phe 


Cys 


Val 


Pro 


Ala 




Pro 






























625 










630 








635 












640 




























30 


Met 


Ala 


Pro 


Pro 


His 


Thr Arg Gly Phe 


Pro 


Ala 


Gin 


Glu 


Ala 


Thr 




Ala 








645 








650 










655 




Val 





























35 



<210> 14 
<211> 184 
40 <212> PRT 

<213> Homo sapiens 



<400> 14 

45 Met Ala Thr Ala Leu Ala Leu Arg Ser Leu Tyr Arg Ala Arg Pro 
Ser 

15 10 15 

Leu Arg Cys Pro Pro Val Glu Leu Pro Trp Ala Pro Arg Arg Gly 
His 

50 20 25 30 

Arg Leu Ser Pro Ala Asp Asp Glu Leu Tyr Gin Arg Thr Arg He 
Ser 
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35 40 45 

Leu Leu Gin Arg Glu Ala Ala Gin Ala Met Tyr He Asp Ser Tyr 
Asn 

50 55 60 

5 Ser Arg Gly Phe Met He Asn Gly Asn Arg Val Leu Gly Pro Cys 
Ala 

65 70 75 

80 

Leu Leu Pro His Ser Val Val Gin Trp Asn Val Gly Ser His Gin 
10 Asp 

85 90 95 

He Thr Glu Asp Ser Phe Ser Leu Phe Trp Leu Leu Glu Pro Arg 
He 

100 105 110 

15 Glu He Val Val Val Gly Thr Gly Asp Arg Thr Glu Arg Leu Gin 
Ser 

115 120 125 

Gin Val Leu Gin Ala Met Arg Gin Arg Gly He Ala Val Glu Val 
Gin 

20 130 135 140 

Asp Thr Pro Asn Ala Cys Ala Thr Phe Asn Phe Leu Cys His Glu 
Gly 

145 150 155 

160 

25 Arg Val Thr Gly Ala Ala Leu He Pro Pro Pro Gly Gly Thr Ser 
Leu 

165 170 175 

Thr Ser Leu Gly Gin Ala Ala Gin 
180 

30 



<210> 15 
<211> 122 
35 <212> PRT 

<213> Homo sapiens 



<400> 15 

40 Met Pro Gly His Met Pro Arg Gly Gly Trp Cys Leu Phe Thr Gly 
Pro 

15 10 15 

Ala Trp Val Gly Ser Ser His Gly Glu Gly Lys Arg Arg Gly Gly 
Ser 

45 20 25 30 

Leu Ala Gin Leu Gly Lys Gin Ser Gin Trp Glu His Lys Pro Val 
Leu 

35 40 45 

Leu His Gly Gin Gly Trp Lys Lys Glu Cys Val Lys Pro Ser Pro 
50 Met 

50 55 60 
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Pro Leu Phe Pro Ser Val Pro Gin Met Ser Tyr Ala Ser Thr Thr 
Pro 

65 70 75 

80 

5 He Ala Leu Ser Thr Pro Arg Arg Ser Ala Ser Gin Trp Arg Ser 
Cys 

85 90 95 

Ser Leu Thr Arg Ala Cys Arg Asn Met He Arg Ser Ser Pro Leu 
Ser 

10 100 105 110 

Arg Trp Leu Pro His Phe Ser Glu Thr Ser 
115 120 



15 

<210> 16 
<211> 527 
<212> PRT 

<213> Homo sapiens 

20 

<400> 16 

Met Ala Gin Thr Leu Gin Met Glu He Pro Asn Phe Gly Asn Ser 
He 

25 1 5 10 15 

Leu Glu Cys Leu Asn Glu Gin Arg Leu Gin Gly Leu Tyr Cys Asp 
Val 

20 25 30 

Ser Val Val Val Lys Gly His Ala Phe Lys Ala His Arg Ala Val 
30 Leu 

35 40 45 

Ala Ala Ser Ser Ser Tyr Phe Arg Asp Leu Phe Asn Asn Ser Arg 
Ser 

50 55 60 

35 Ala Val Val Glu Leu Pro Ala Ala Val Gin Pro Gin Ser Phe Gin 
Gin 

65 70 75 

80 

He Leu Ser Phe Cys Tyr Thr Gly Arg Leu Ser Met Asn Val Gly 
40 Asp 

85 90 95 

Gin Phe Leu Leu Met Tyr Thr Ala Gly Phe Leu Gin He Gin Glu 
lie 

100 105 110 

45 Met Glu Lys Gly Thr Glu Phe Phe Leu Lys Val Ser Ser Pro Ser 
Cys 

115 120 125 

Asp Ser Gin Gly Leu His Ala Glu Glu Ala Pro Ser Ser Glu Pro 
Gin 

50 130 135 140 

Ser Pro Val Ala Gin Thr Ser Gly Trp Pro Ala Cys Ser Thr Pro 
Leu 
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145 150 155 

160 

Pro Leu Val Ser Arg Val Lys Thr Glu Gin Gin Glu Ser Asp Ser 
Val 

5 165 170 175 

Gin Cys Met Pro Val Ala Lys Arg Leu Trp Asp Ser Gly Gin Lys 
Glu 

180 185 190 

Ala Gly Gly Gly Gly Asn Gly Ser Arg Lys Met Ala Lys Phe Ser 
10 Thr 

195 200 205 

Pro Asp Leu Ala Ala Asn Arg Pro His Gin Pro Pro Pro Pro Gin 
Gin 

210 215 220 

15 Ala Pro Val Val Ala Ala Ala Gin Pro Ala Val Ala Ala Gly Ala 
Gly 

225 230 235 

240 

Gin Pro Ala Gly Gly Val Ala Ala Ala Gly Gly Val Val Ser Gly 
20 Pro 

245 250 255 

Ser Thr Ser Glu Arg Thr Ser Pro Gly Thr Ser Ser Ala Tyr Thr 
Ser 

260 265 270 

25 Asp Ser Pro Gly Ser Tyr His Asn Glu Glu Asp Glu Glu Glu Asp 
Gly 

275 280 285 

Gly Glu Glu Gly Met Asp Glu Gin Tyr Arg Gin lie Cys Asn Met 
Tyr 

30 290 295 300 

Thr Met Tyr Ser Met Met Asn Val Gly Gin Thr Ala Glu Lys Val 
Glu 

305 310 315 

320 

35 Ala Leu Pro Glu Gin Val Ala Pro Glu Ser Arg Asn Arg lie Arg 
Val 

325 330 335 

Arg Gin Asp Leu Ala Ser Leu Pro Ala Glu Leu lie Asn Gin lie 
Gly 

40 340 345 350 

Asn Arg Cys His Pro Lys Leu Tyr Asp Glu Gly Asp Pro Ser Glu 
Lys 

355 360 365 

Leu Glu Leu Val Thr Gly Thr Asn Val Tyr lie Thr Arg Ala Gin 
45 Leu 

370 375 380 

Met Asn Cys His Val Ser Ala Gly Thr Arg His Lys Val Leu Leu 
Arg 

385 390 395 

50 400 

Arg Leu Leu Ala Ser Phe Phe Asp Arg Asn Thr Leu Ala Asn Ser 
Cys 
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405 410 415 

Gly Thr Gly He Arg Ser Ser Thr Asn Asp Pro Arg Arg Lys Pro 
Leu 

420 425 430 

5 Asp Ser Arg Val Leu His Ala Val Lys Tyr Tyr Cys Gin Asn Phe 
Ala 

435 440 445 

Pro Asn Phe Lys Glu Ser Glu Met Asn Ala He Ala Ala Asp Met 
Cys 

10 450 455 460 

Thr Asn Ala Arg Arg Val Val Arg Lys Ser Trp Met Pro Lys Val 
Lys 

465 470 475 

480 

15 Val Leu Lys Ala Glu Asp Asp Ala Tyr Thr Thr Phe He Ser Glu 
Thr 

485 490 495 

Gly Lys He Glu Pro Asp Met Met Gly Val Glu His Gly Phe Glu 
Thr 

20 500 505 510 

Ala Ser His Glu Gly Glu Ala Gly Pro Ser Ala Glu Ala Leu Gin 
515 520 525 



25 



30 



<210> 17 
<211> 576 
<212> PRT 

<213> Homo sapiens 



<400> 17 

Met Leu Ala Met Gly Ala Leu Ala Gly Phe Trp He Leu Cys Leu 
Leu 

35 1 5 10 15 

Thr Tyr Gly Tyr Leu Ser Trp Gly Gin Ala Leu Glu Glu Glu Glu 
Glu 

20 25 30 

Gly Ala Leu Leu Ala Gin Ala Gly Glu Lys Leu Glu Pro Ser Thr 
40 Thr 

35 40 45 

Ser Thr Ser Gin Pro His Leu He Phe He Leu Ala Asp Asp Gin 
Gly 

50 55 60 

45 Phe Arg Asp Val Gly Tyr His Gly Ser Glu He Lys Thr Pro Thr 
Leu 

65 70 75 

80 

Asp Lys Leu Ala Ala Glu Gly Val Lys Leu Glu Asn Tyr Tyr Val 
50 Gin 

85 90 95 
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Pro 


He 


Cys Thr 


Pro 


Ser 


Arg 


Ser 


Gin 


Phe 


He 


Thr 


Gly Lys Tyr 




Gin 




100 










105 










110 




He 


His 


Thr Gly 


Leu 


Gin 


His 


Ser 


He 


lie 


Arg 


Pro 


Thr 


Gin Pro 


5 


Asn 




115 








120 










125 






Cys 


Leu 


Pro Leu 


Asp Asn Ala 


Thr 


Leu 


Pro 


Gin 


Lys 


Leu 


Lys Glu 




Val 


130 








135 










140 






10 


Gly 


Tyr 


Ser Thr 


His 


Met 


Val 


Gly Lys 


Trp 


His 


Leu 


Gly Phe Tyr 




Arg 




























145 








150 










155 










160 




























Lys 


Glu 


Cys Met 


Pro Thr Arg Arg Gly 


Phe 


Asp 


Thr 


Phe 


Phe Gly 


15 


Ser 






165 










170 








175 




Leu 


Leu Gly Ser 


Gly Asp Tyr Tyr 


Thr 


His 


Tyr 


Lys 


Cys 


Asp Ser 




Pro 




180 










185 










190 


20 


Gly 


Met 


Cys Gly 


Tyr 


Asp 


Leu 


Tyr 


Glu 


Asn Asp 


Asn 


Ala 


Ala Trp 




Asp 




195 








200 










205 






Tyr 


Asp Asn Gly 


He 


Tyr 


Ser 


Thr 


Gin 


Met 


Tyr 


Thr 


Gin Arg Val 




Gin 


























25 




210 








215 










220 








Gin 


He 


Leu Ala 


Ser 


His 


Asn 


Pro 


Thr 


Lys 


Pro 


lie 


Phe 


Leu Tyr 




He 




























225 








230 










235 










240 


























30 


Ala 


Tyr 


Gin Ala 


Val 


His 


Ser 


Pro 


Leu 


Gin 


Ala 


Pro 


Gly Arg Tyr 




Phe 






245 










250 








255 




Glu 


His 


Tyr Arg 


Ser 


lie 


lie 


Asn 


lie 


Asn Arg 


Arg 


Arg 


Tyr Ala 




Ala 


























35 






260 










265 










270 




Met 


Leu 


Ser Cys 


Leu Asp 


Glu 


Ala 


He 


Asn 


Asn 


Val 


Thr 


Leu Ala 




Leu 




275 








280 










285 






Lys 


Thr Tyr Gly 


Phe 


Tyr Asn Asn 


Ser 


lie 


He 


lie 


Tyr 


Ser Ser 


40 


Asp 


290 








295 










300 








Asn 


Gly Gly Gin 


Pro Thr Ala Gly Gly Ser Asn Trp 


Pro 


Leu Arg 




Gly 




























305 








310 










315 








45 


320 




























Ser 


Lys 


Gly Thr 


Tyr Trp Glu Gly Gly 


lie 


Arg 


Ala 


Val 


Gly Phe 




Val 






325 










330 








335 




His 


Ser 


Pro Leu 


Leu 


Lys 


Asn 


Lys 


Gly Thr Val 


Cys 


Lys 


Glu Leu 


50 


Val 




340 










345 










350 
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His lie Thr Asp Trp Tyr Pro Thr Leu lie Ser Leu Ala Glu Gly 
Gin 

355 360 365 

He Asp Glu Asp He Gin Leu Asp Gly Tyr Asp He Trp Glu Thr 
5 He 

370 375 380 

Ser Glu Gly Leu Arg Ser Pro Arg Val Asp He Leu His Asn He 
Asp 

385 390 395 

10 400 

Pro He Tyr Thr Lys Ala Lys Asn Gly Ser Trp Ala Ala Gly Tyr 
Gly 

405 410 415 

lie Trp Asn Thr Ala He Gin Ser Ala He Arg Val Gin His Trp 
15 Lys 

420 425 430 

Leu Leu Thr Gly Asn Pro Gly Tyr Ser Asp Trp Val Pro Pro Gin 
Ser 

435 440 445 

20 Phe Ser Asn Leu Gly Pro Asn Arg Trp His Asn Glu Arg He Thr 
Leu 

450 455 460 

Ser Thr Gly Lys Ser Val Trp Leu Phe Asn He Thr Ala Asp Pro 
Tyr 

25 465 470 475 

480 

Glu Arg Val Asp Leu Ser Asn Arg Tyr Pro Gly lie Val Lys Lys 
Leu 

485 490 495 

30 Leu Arg Arg Leu Ser Gin Phe Asn Lys Thr Ala Val Pro Val Arg 
Tyr 

500 505 510 

Pro Pro Lys Asp Pro Arg Ser Asn Pro Arg Leu Asn Gly Gly Val 
Trp 

35 515 520 525 

Gly Pro Trp Tyr Lys Glu Glu Thr Lys Lys Lys Lys Pro Ser Lys 
Asn 

530 535 540 

Gin Ala Glu Lys Lys Gin Lys Lys Ser Lys Lys Lys Lys Lys Lys 
40 Gin 

545 550 555 

560 

Gin Lys Ala Val Ser Gly Ser Thr Cys His Ser Gly Val Thr Cys 
Gly 

45 565 570 575 



50 

<210> 18 
<211> 336 
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<212> PRT 

<213> Homo sapiens 



5 <400> 18 

Met Glu Pro Gin Lys He Met Pro Pro Ser Lys Pro His Pro Pro 
Val 

15 10 15 

Val Gly Lys Val Thr His His Ser He Glu Leu Tyr Trp Asp Leu 
10 Glu 

20 25 30 

Lys Lys Ala Lys Arg Gin Gly Pro Gin Glu Gin Trp Phe Arg Phe 
Ser 

35 40 45 

15 He Glu Glu Glu Asp Pro Lys Met His Thr Tyr Gly He He Tyr 
Thr 

50 55 60 

Gly Tyr Ala Thr Lys His Val Val Glu Gly Leu Glu Pro Arg Thr 
Leu 

20 65 70 75 

80 

Tyr Arg Phe Arg Leu Lys Val Thr Ser Pro Ser Gly Glu Cys Glu 
Tyr 

85 90 95 

25 Ser Pro Leu Val Ser Val Ser Thr Thr Arg Glu Pro He Ser Ser 
Glu 

100 105 110 

His Leu His Arg Ala Val Ser Val Asn Asp Glu Asp Leu Leu Val 
Arg 

30 115 120 125 

He Leu Gin Gly Gly Arg Val Lys Val Asp Val Pro Asn Lys Phe 
Gly 

130 135 140 

Phe Thr Ala Leu Met Val Ala Ala Gin Lys Gly Tyr Thr Arg Leu 
35 Val 

145 150 155 

160 

Lys He Leu Val Ser Asn Gly Thr Asp Val Asn Leu Lys Asn Gly 
Ser 

40 165 170 175 

Gly Lys Asp Ser Leu Met Leu Ala Cys Tyr Ala Gly His Leu Asp 
Val 

180 185 190 

Val Lys Tyr Leu Arg Arg His Gly Ala Ser Trp Gin Ala Arg Asp 
45 Leu 

195 200 205 

Gly Gly Cys Thr Ala Leu His Trp Ala Ala Asp Gly Gly His Cys 
Ser 

210 215 220 

50 Val He Glu Trp Met He Lys Asp Gly Cys Glu Val Asp Val Val 
Asp 
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225 230 235 

240 

Thr Gly Ser Gly Trp Thr Pro Leu Met Arg Val Ser Ala Val Ser 
Gly 

5 245 250 255 

Asn Gin Arg Val Ala Ser Leu Leu lie Asp Ala Gly Ala Asn Val 
Asn 

260 265 270 

Val Lys Asp Arg Asn Gly Lys Thr Pro Leu Met Val Ala Val Leu 
10 Asn 

275 280 285 

Asn His Glu Glu Leu Val Gin Leu Leu Leu Asp Lys Gly Ala Asp 
Ala 

290 295 300 

15 Ser Val Lys Asn Glu Phe Gly Lys Gly Val Leu Glu Met Ala Arg 
Val 

305 310 315 

320 

Phe Asp Arg Gin Val Gly Met Leu Phe Leu Pro lie Lys Ala Asn 
20 Phe 

325 330 335 



25 



<210> 19 

<211> 621 

<212> PRT 

30 <213> Homo sapiens 



<400> 19 

Met Ala Leu Gly Leu Glu Gin Ala Glu Glu Gin Arg Leu Tyr Gin 
35 Gin 

15 10 15 

Thr Leu Leu Gin Asp Gly Leu Lys Asp Met Leu Asp His Gly Lys 
Phe 

20 25 30 

40 Leu Asp Cys Val Val Arg Ala Gly Glu Arg Glu Phe Pro Cys His 
Arg 

35 40 45 

Leu Val Leu Ala Ala Cys Ser Pro Tyr Phe Arg Ala Arg Phe Leu 
Ala 

45 50 55 60 

Glu Pro Glu Arg Ala Gly Glu Leu His Leu Glu Glu Val Ser Pro 
Asp 

65 70 75 

80 

50 Val Val Ala Gin Val Leu His Tyr Leu Tyr Thr Ser Glu He Ala 
Leu 

85 90 95 
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Asp Glu Ala Ser Val Gin Asp Leu Phe Ala Ala Ala His Arg Phe 
Gin 

100 105 110 

lie Pro Ser lie Phe Thr lie Cys Val Ser Phe Leu Gin Lys Arg 
5 Leu 

115 120 125 

Cys Leu Ser Asn Cys Leu Ala Val Phe Arg Leu Gly Leu Leu Leu 
Asp 

130 135 140 

10 Cys Ala Arg Leu Ala Val Ala Ala Arg Asp Phe lie Cys Ala His 
Phe 

145 150 155 

160 

Thr Leu Val Ala Arg Asp Ala Asp Phe Leu Gly Leu Ser Ala Asp 
15 Glu 

165 170 175 

Leu lie Ala lie lie Ser Ser Asp Gly Leu Asn Val Glu Lys Glu 
Glu 

180 185 190 

20 Ala Val Phe Glu Ala Val Met Arg Trp Ala Gly Ser Gly Asp Ala 
Glu 

195 200 205 

Ala Gin Ala Glu Arg Gin Arg Ala Leu Pro Thr Val Phe Glu Ser 
Val 

25 210 215 220 

Arg Cys Arg Leu Leu Pro Arg Ala Phe Leu Glu Ser Arg Val Glu 
Arg 

225 230 235 

240 

30 His Pro Leu Val Arg Ala Gin Pro Glu Leu Leu Arg Lys Val Gin 
Met 

245 250 255 

Val Lys Asp Ala His Glu Gly Arg lie Thr Thr Leu Arg Lys Lys 
Lys 

35 260 265 270 

Lys Gly Lys Asp Gly Ala Gly Ala Lys Glu Ala Asp Lys Gly Thr 
Ser 

275 280 285 

Lys Ala Lys Ala Glu Glu Asp Glu Glu Ala Glu Arg lie Leu Pro 
40 Gly 

290 295 300 

lie Leu Asn Asp Thr Leu Arg Phe Gly Met Phe Leu Gin Asp Leu 
He 

305 310 315 

45 320 

Phe Met He Ser Glu Glu Gly Ala Val Ala Tyr Asp Pro Ala Ala 
Asn 

325 330 335 

Glu Cys Tyr Cys Ala Ser Leu Ser Asn Gin Val Pro Lys Asn His 
50 Val 

340 345 350 
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Ser Leu Val Thr Lys Glu Asn Gin Val Phe Val Ala Gly Gly Leu 
Phe 

355 360 365 

Tyr Asn Glu Asp Asn Lys Glu Asp Pro Met Ser Ala Tyr Phe Leu 
5 Gin 

370 375 380 

Phe Asp His Leu Asp Ser Glu Trp Leu Gly Met Pro Pro Leu Pro 
Ser 

385 390 395 

10 400 

Pro Arg Cys Leu Phe Gly Leu Gly Glu Ala Leu Asn Ser lie Tyr 
Val 

405 410 415 

Val Gly Gly Arg Glu lie Lys Asp Gly Glu Arg Cys Leu Asp Ser 
15 Val 

420 425 430 

Met Cys Tyr Asp Arg Leu Ser Phe Lys Trp Gly Glu Ser Asp Pro 
Leu 

435 440 445 

20 Pro Tyr Val Val Tyr Gly His Thr Val Leu Ser His Met Asp Leu 
Val 

450 455 460 

Tyr Val lie Gly Gly Lys Gly Ser Asp Arg Lys Cys Leu Asn Lys 
Met 

25 465 470 475 

480 

Cys Val Tyr Asp Pro Lys Lys Phe Glu Trp Lys Glu Leu Ala Pro 
Met 

485 490 495 

30 Gin Thr Ala Arg Ser Leu Phe Gly Ala Thr Val His Asp Gly Arg 
He 

500 505 510 

He Val Ala Ala Gly Val Thr Asp Thr Gly Leu Thr Ser Ser Ala 
Glu 

35 515 520 525 

Val Tyr Ser He Thr Asp Asn Lys Trp Ala Pro Phe Glu Ala Phe 
Pro 

530 535 540 

Gin Glu Arg Ser Ser Leu Ser Leu Val Ser Leu Val Gly Thr Leu 
40 Tyr 

545 550 555 

560 

Ala He Gly Gly Phe Ala Thr Leu Glu Thr Glu Ser Gly Glu Leu 
Val 

45 565 570 575 

Pro Thr Glu Leu Asn Asp He Trp Arg Tyr Asn Glu Glu Glu Lys 
Lys 

580 585 590 

Trp Glu Gly Val Leu Arg Glu He Ala Tyr Ala Ala Gly Ala Thr 
50 Phe 

595 600 605 

Leu Pro Val Arg Leu Asn Val Leu Arg Leu Thr Lys Met 

85 
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610 615 



5 <210> 20 

<211> 469 
<212> PRT 
<213> Homo sapiens 

10 

<400> 20 

Met Asp Leu Gin Pro Asn Lys Gin Lys Asp 
Arg 

15 10 
15 Gin Ala Gly Ser Val Gly Gly Leu Gin Trp 
Arg 

20 25 
Leu Glu Thr Glu Ala Ser Thr Gly Gin Gin 
Leu 

20 35 40 

Ser Ser Pro Phe Asp Leu Asn Phe Pro Leu 
Pro 

50 55 
Ala Cys Leu Val Lys Val Tyr Glu Asp Trp 
25 Asn 

65 70 

80 

Asp lie Leu Glu Leu Tyr Gly He Leu Ser 
Ser 

30 85 90 

He Leu Asn Asn Asp Glu Arg Asp Ala Ser 
Met 

100 105 
Glu Cys Thr Asp Thr Ala Glu Glu Gin Arg 
35 Ala 

115 120 
Ser Leu Val Pro Arg He His Val He Leu 
His 

130 135 
40 He Asn Pro Leu Leu Pro Ala Cys Leu Asn 
Thr 

145 150 
160 

Cys Lys Phe Val Ser Ser Phe Met Ser Glu 
45 Ala 

165 170 
Glu Leu Leu Gly Phe Leu Thr His Ala Leu 
Ala 

180 185 
50 Ala Glu Tyr Leu He Leu His Leu He Ser 
Arg 

195 200 

86 
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620 



Gin His Ala Gly Ala 

15 

Cys Gly Glu Pro Lys 
30 

Leu Asn Ser Leu Asn 
45 

Pro Gly Glu Lys Gly 
60 

Asp Cys Phe Lys Val 
75 

Val Asp Pro Val Leu 

95 

Ala Leu Leu Asp Pro 
110 

Val His Ser Pro Pro 
125 

Ala Gin Lys Leu Gin 
140 

Lys Glu Glu Ser Lys 
155 

Leu Ser Pro Val Arg 
175 

Leu Gly Asp Ser Leu 
190 

Thr Val Tyr Thr Arg 
205 
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Asp Val Leu Pro Leu Gly Lys Phe Thr Val Asn Leu Ser Gly Cys 
Pro 

210 215 220 

Arg Asn Ser Thr Phe Thr Glu His Leu Tyr Arg lie lie Gin His 
5 Leu 

225 230 235 

240 

Val Pro Ala Ser Phe Arg Leu Gin Met Thr lie Glu Asn Met Asn 
His 

10 245 250 255 

Leu Lys Phe lie Pro His Lys Asp Tyr Thr Ala Asn Arg Leu Val 
Ser 

260 265 270 

Gly Leu Leu Gin Leu Pro Ser Asn Thr Ser Leu Val lie Asp Glu 
15 Thr 

275 280 285 

Leu Leu Glu Gin Gly Gin Leu Asp Thr Pro Gly Val His Asn Val 
Thr 

290 295 300 

20 Ala Leu Ser Asn Leu lie Thr Trp Gin Lys Val Asp Tyr Asp Phe 
Ser 

305 310 315 

320 

Tyr His Gin Met Glu Phe Pro Cys Asn lie Asn Val Phe lie Thr 
25 Ser 

325 330 335 

Glu Gly Arg Ser Leu Leu Pro Ala Asp Cys Gin lie His Leu Gin 
Pro 

340 345 350 

30 Gin Leu lie Pro Pro Asn Met Glu Glu Tyr Met Asn Ser Leu Leu 
Ser 

355 360 365 

Ala Val Leu Pro Ser Val Leu Asn Lys Phe Arg lie Tyr Leu Thr 
Leu 

35 370 375 380 

Leu Arg Phe Leu Glu Tyr Ser lie Ser Asp Glu lie Thr Lys Ala 
Val 

385 390 395 

400 

40 Glu Asp Asp Phe Val Glu Met Arg Lys Asn Asp Pro Gin Ser lie 
Thr 

405 410 415 

Ala Asp Asp Leu His Gin Leu Leu Val Val Ala Arg Cys Leu Ser 
Leu 

45 420 425 430 

Ser Ala Gly Gin Thr Thr Leu Ser Arg Glu Arg Trp Leu Arg Ala 
Lys 

435 440 445 

Gin Leu Glu Ser Leu Arg Arg Thr Arg Leu Gin Gin Gin Lys Cys 
50 Val 

450 455 460 

Asn Gly Asn Glu Leu 
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5 <210> 21 

<211> 473 
<212> PRT 

<213> Homo sapiens 

10 

<400> 21 

Met Cys Ser Gly Leu Leu Glu Leu Leu Leu Pro lie Trp Leu Ser 
Trp 

15 10 15 

15 Thr Leu Gly Thr Arg Gly Ser Glu Pro Arg Ser Val Asn Asp Pro 
Gly 

20 25 30 

Asn Met Ser Phe Val Lys Glu Thr Val Asp Lys Leu Leu Lys Gly 
Tyr 

20 35 40 45 

Asp lie Arg Leu Arg Pro Asp Phe Gly Gly Pro Pro Val Cys Val 
Gly 

50 55 60 

Met Asn lie Asp lie Ala Ser He Asp Met Val Ser Glu Val Asn 
25 Met 

65 70 75 

80 

Asp Tyr Thr Leu Thr Met Tyr Phe Gin Gin Tyr Trp Arg Asp Lys 
Arg 

30 ~ 85 90 95 

Leu Ala Tyr Ser Gly He Pro Leu Asn Leu Thr Leu Asp Asn Arg 
Val 

100 105 110 

Ala Asp Gin Leu Trp Val Pro Asp Thr Tyr Phe Leu Asn Asp Lys 
35 Lys 

115 120 125 

Ser Phe Val His Gly Val Thr Val Lys Asn Arg Met He Arg Leu 
His 

130 135 140 

40 Pro Asp Gly Thr Val Leu Tyr Gly Leu Arg He Thr Thr Thr Ala 
Ala 

145 150 155 

160 

Cys Met Met Asp Leu Arg Arg Tyr Pro Leu Asp Glu Gin Asn Cys 
45 Thr 

165 170 175 

Leu Glu lie Glu Ser Tyr Gly Tyr Thr Thr Asp Asp He Glu Phe 
Tyr 

180 185 190 

50 Trp Arg Gly Gly Asp Lys Ala Val Thr Gly Val Glu Arg He Glu 
Leu 

195 200 205 
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Pro Gin Phe 
Val 

210 

Phe Ala Thr 
5 Lys 

225 
240 ' 

Arg Asn lie 
Leu 

10 

He Thr He 
Ser 

Ala Ala Arg 
15 Thr 

275 

lie Asn Thr 
Lys 

290 

20 Ala He Asp 
Ala 

305 
320 
Leu Leu Glu 
25 Pro 

Gin Arg Gin 
Asp 

30 Arg Ser Lys 
Leu 

355 

Leu Thr Ser 
Gly 

35 370 

He Gly Asp 
He 

385 
400 

40 Gin Tyr Arg 
Leu 

Gly Asp Arg 
Ser 

45 

Ser Gin Leu 
He 

435 

Asp Arg Trp 
50 Asn 

450 
Leu Val Tyr 



Ser He Val Glu 
215 

Gly Ala Tyr Pro 
230 

Gly Tyr Phe He 
245 

Leu Ser Trp Val 
260 

Val Ala Leu Gly 

His Leu Arg Glu 
295 

Met Tyr Leu Met 
310 

Tyr Ala Phe Val 
325 

Lys Lys Leu Ala 
340 

Ser Glu Ser Asn 

Leu Glu Val His 
375 

Thr Arg Asn Ser 
390 

Lys Gin Ser Met 
405 

Ser Leu Pro His 
420 

Lys He Lys He 

Ser Arg He Val 
455 

Trp Leu Tyr Tyr 



His Arg Leu Val 

Arg Leu Ser Leu 
235 

Leu Gin Thr Tyr 
250 

Ser Phe Trp He 
265 

He Thr Thr Val 
280 

Thr Leu Pro Lys 

Gly Cys Phe Val 
315 

Asn Tyr He Phe 
330 

Glu Lys Thr Ala 
345 

Arg Val Asp Ala 
360 

Asn Glu Met Asn 

Ala He Ser Phe 
395 

Pro Arg Glu Gly 
410 

Lys Lys Thr His 
425 

Pro Asp Leu Thr 
440 

Phe Pro Phe Thr 

Val Asn 
89 
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Ser Arg Asn Val 

220 

Ser Phe Arg Leu 



Met Pro Ser He 
255 

Asn Tyr Asp Ala 
270 

Leu Thr Met Thr 
285 

He Pro Tyr Val 
300 

Phe Val Phe Leu 



Phe Gly Arg Gly 
335 

Lys Ala Lys Asn 
350 

His Gly Asn He 
365 

Glu Val Ser Gly 
380 

Asp Asn Ser Gly 



His Gly Arg Phe 
415 

Leu Arg Arg Arg 
430 

Asp Val Asn Ala 
445 

Phe Ser Leu Phe 
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5 <210> 22 

<211> 156 
<212> PRT 
<213> Homo sapiens 

10 

<400> 22 

Met Phe Leu Thr Ala Val Asn Pro Gin Pro Leu Ser Thr Pro Ser 
Trp 

15 10 15 

15 Gin lie Glu Thr Lys Tyr Ser Thr Lys Val Leu Thr Gly Asn Trp 
Met 

20 25 30 

Glu Glu Arg Arg Lys Phe Thr Arg Asp Thr Asp Lys Thr Pro Gin 
Ser 

20 35 40 45 

lie Tyr Arg Lys Glu Tyr lie Pro Phe Pro Asp His Arg Pro Asp 
Gin 

50 55 60 

lie Ser Arg Trp Tyr Gly Lys Arg Lys Val Glu Gly Leu Pro Tyr 
25 Lys 

65 70 75 

80 

His Leu lie Thr Pro His Gin Glu Pro Pro His Arg Tyr Leu lie 
Ser 

30 85 90 95 

Thr Tyr Asp Asp His Tyr Asn Arg His Gly Tyr Asn Pro Gly Cys 
Leu 

100 105 110 

His Ser Ala Leu Gly Met Asp Arg Ser Cys Cys Gly Cys Gin Arg 
35 Ser 

115 120 125 

Leu Thr Phe Pro Phe Leu Leu Pro Leu Gin Thr Met Asp Ser Met 
Ser 

130 135 140 

40 Ser Ser Ser Arg Asp Ser Ser His Pro Arg Leu Ala 
145 150 155 



45 <210> 23 

<211> 2631 

<212> DNA 

<213> Homo sapiens 

50 <220> 

<221> misc_feature 

<222> (1) . . . (2631) 
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<400> 23 

5 gcggccgccg ccacgtacat gaagtgcagc tgcgccggcg ggggcggctc 
cacggtctca 60 

ctctgcacgg cctcgatctt gtaggggatg ttgaggctgc ccagcgaggc 
gagcgctccc 120 
aggaatgcgg ccacgtcggt ggcactctgg aagcactgcg aggaggcctg 
10 cacacactgc 180 

cggttgtcaa tctccaggta gacgatggag ccgcggacgt ccatggggtc 
cagctccctc 240 

cgccgccgcc caccctcgct gccaccaggg agcagcgagg ccttcacctg 
gcccagcagg 300 
15 gcgtcaggtg cggcccagcc ctcggcggca cgcttgatgg ggtgcttgcg 
cagctcctcc 360 

tcgcggccgt agtaagggga agatcatctg ctggccgctg agcgtcacgc 
tttgaagacc 420 
acgttggtgt tgcagcacgc gggcttgaac tcccgcagga aagtggaaag 
20 gagctgttgc 480 

gcagctgctc cggcggcatc agcaccacca ctcaccgcgt gccggccgcn 
cagcctctcg 54 0 

ggtacatgct ccgcacagtc cagcccgtcc cactcgcact ccgcgctgtt 
gcagccctgg 6 00 
25 tcgcagtgcc cgtcgctgaa gtggtccttg cagtactggt cgtacagggg 
gttgcactgg 660 

ccttccgcac gctggcagtc aaagccgtcg aagaggcagc cggctgagtt 
gcactggctg 720 
tcacagtggc cgtcactgaa gtacttccag cactgcagag actgcgtgca 
30 gttcttccag 780 

gggtcattga agttgaggga gcagtcaccg ccgtcccagc cgcacgcgtg 
gttgttgcac 840 

tgcaggctgc agaccttgtt gcccgcgtcc tcctggcact cgggcagctc 
gcacgcctcc 900 
35 tcgatcagcg gcggggggat gtcgcgcccg gccccacccc cgaagctgta 
gtccaggatg 960 

tggcacaaga gcccgttgaa tttggcgggg cacaggcaac ggtagaaggg 
gctctcggat 1020 
gtgggctcac aggtcccctg gttgtagcag gggttgccgc ccaggcaggg 
40 gctgctggcc 108 0 

gggaactggc attcggggcc cgtgaagggg cccaggcaca ggcaggtggg 
gctgcgcggg 114 0 

ccggagatgc atgtgccgcc gttgaggcag cgcaggctgc cgcaggtacg 
agcgtcattc 1200 
45 tcacacgtgg cgccctcgaa gcccgcaggg cacttgcaga tgaaacccgc 
gggcggtgtt 12 e 0 

ggaggccacg gcgcaggtgc ccccattctt gcagggcttg cctttgcagc 
cattgatgac 1320 
ggactcgcag cggcgcccgg tgtgaccagc acggcactcg cagtggaagt 
50 cattgacgcg 1380 

ctgcacgcag ttctgggtgc cacgggcgtc gcagggattg gacaggcact 
cgttgacatc 1440 
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cccctcacag cgctcaccca cgaagcccgg cgggcaggtg cagctgtagc 
cgcccacctg 1500 

gtccacgcag gtgccgttgt taaagcactt ggggctccgg gacacggggt 
caacgggggg 156 0 
5 attgcagtcg tccacgttga tctcacagtg cacaccctga gtgccccgtg 
ggcaggagca 162 0 

cttgtaggtg ttggggaggt cgaggcaggt gcccccgttc tggcaggggt 

gggagaggca 1680 

ctcgtcgatc tcctcagagc agttcacccc gtggtagccg gccacgcact 
10 tgcaggagta 174 0 

gccgcccagg tagtccgtgc aggtggcccc gttctggcag gggctgggtg 
agcactcgtc 1800 

caccaggtcc tcacagtagc tgcctgtgta gcccgcctgg cagcggcagt 
ggtgcgtgtt 186 0 
15 gcccgcgtcc acacagagcc ctccatgctg gcacaggcgg gcaacgtcaa 
caccttgtcg 1920 

ctgcgcagcc acctcacagg acacgctggg cacgtcgcag taaaggccgg 
tccagccgct 1980 
ggggcactcg cagcggtact gggtgtgggt ctgccagcat ttgccgccgt 
20 tcttgcaggg 204 0 

cgaggagtca caccagtgca caaggttctg gcagttgggg ccagtgtagc 
cctgggggca 2100 

ggtgcacctg tggagaccgc gaccgtcctg acaggtgccg cctagcaggc 
agggtcgtga 2160 
25 gtcgcactca ttgactacgt gctggcagta gctgcccgtg aagccgggtg 
gacacaggca 2220 

ggtgaacgag ttgatgccgt ccacgcaggt gccaccgttg aagcaggagc 
tctctgtgca 2280 
gtcaggcgtg ttgttctcac agtggatccc gctgaagcct gcggggcagg 
30 tgcacgtgta 2340 

gctgtccacg cagtccgtgc agttggcccc gttgcggcag gggtcactgg 
cacactcgtt 2400 

gatgtcctcc tcacagaaag tgccccggaa gccgggcagg cagtcgcaga 
aggccgtgtt 2460 
35 gatgccgtct gtgcaggagc ccccgttgtg acacgggttg ggccggcagt 
cgtcgatgtc 2520 

ggtctcgcag ttgcgcccac tgtagccggc ctggcagtgg cagcggtagc 
cgccgtgggt 2580 
gttctggcag gatgcgccgt gccggcacgg gctcagaacg cactcgttga t 
40 2631 



<210> 24 
45 <211> 3269 

<212> DNA 
<213> Homo sapiens 



50 <400> 24 

tttttttttt ttactcaatg ctcagggttt atttgcaaaa ctgggctggg 
ttgagcagag 6 0 
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ggggctttgg gatgctgacc cctctctgac ctggccgttt cgacctcagc 
cctgagttac 120 

cctgcagcag gcatggacag gcggccgcct tgcctgtcgc agcttcctgg 
cagcagctgg 180 
5 ccttgcctgg accagcctgt ctgccccagc ggccggacgt ggccagctgg 
aatccgtgct 240 

gcgcgggcgc tccggccgtg gcagggacac ataggcccgc acatgcttag 
tttgttccag 300 

ggcgtcccag tgacatcagg gaggggcaag gccaaggtgg gccacaggct 
10 ctcccagggt 360 

aggactcctg agtggcgccc tcccgaaagg ggtcctgctg gcccctcacc 
cttctggggc 420 

cggtttcacc cacggggtgg gcagcaggag cacggagcgc ggggtgccac 
acacaggccg 480 
15 gggggctccg cagaggaggc ccaaggtttt catgatcaga caatgcacat 
cctggctcca 54 0 

ggtggacaga gcatggggat ggaggcagaa cgttctcaga cagcagtggc 
ctcctgggcc 600 

gggaagccac gggtgtgtgg gggagccatg ggcgcaggga cacagaaggg 
20 cgtgcagcca 660 

cactcatcca cgtgggtgta ggtgtgcagg atggctgagc cgttaggaca 
gtgcaagggc 720 

accgtctcct cgtggacccg cctctcctgg cagcaggtgc actggtgctg 
catggcctgg 780 
25 gcctcttgct gagtacttgg acgcttccgg ggcaggagcc cctcgcagaa 
ggtgatgttg 84 0 

acctcggtct cgcagccctg gtgccacagg gatggtcgtg ttgatgcgga 
cttggacagg 900 

agtcctcctc acaggagtag cagcagccgg ttttcctgag gctccccctg 
30 cagctggaca 960 

catctgggca ggatgcaggc tgtggggtca gcaaatggac tccaccctca 
gcctcacaga 1020 

ggtacacggt gcagttgtcc acatggctgt tgacccaggt ttcattcagc 
tggactggct 1080 
35 ggccatcggg cgtgaggcag gcggtctgga cgcactcccc acagcactgc 
ccggccactc 1140 

tcttgtactc aaagccctgg ggacaggtag tattgttgca ggcatcctcc 
tgacattgca 1200 

ccgttgggtc ctgggtgtcc ccagagaggc aggtacacat gtggcaggga 
40 agggcgcctg 1260 

ggaaggttgc accaaccccg tagaaggtgc cattgtacga acacagctga 
ggtctgcagc 1320 

ggaaggtggg acagcagtcg ccctcctcct gggtgcagat ggactcctgc 
cctggcgggc 13 80 
45 acacaggcag gctctggggg caggtggttg tgttgcacac gcacaccgtc 
tcggggcagc 144 0 

aggggttctc ggcccggggc ctggtcacgg ttacgaagcc gggacggttg 
cacggcgggg 1500 

gctgaccctg ggcgtcacag ggcaggggct tgcactgcac cgacactgaa 
50 ccctcgtcac 1560 

acacgcagga ctggcagttg ctgacccacc gctccccggg aaatttagga 
aacccatcgg 1620 . 
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gtcccacgca ggggcaggcc 
aggatctggt 1680 

cctcagggca gaagcagccc 
ttcctagagt 1740 
5 tgcaggtggc aggctgtatg 
gggcaggtga 18 00 

ggtcgcacag gccaccggtt 
gcgcggcaga 1860 

gctctgcgta agcctccagg 
10 cagtggtcgc 1920 

tgatgcaggc gttgaagaat 
gcaaagacct 1980 

ggctcagcat cagatcacag 
ggtgtgctag 2040 
15 acaccggggc tgcggggctg 
cagccatcct 2100 

ttctgctgtc ggggaccagc 
gcagtggttc 2160 

cgtcccgctg gagacagtcg 
20 tggccctcgg 2220 

tgttgttgtg gaagaggctg 
ttgaaggtga 2280 

cgctcacgcc cagggcagga 
acagacacaa 234 0 
25 gcacgccgtt cttgctgaaa 
atcaggccct 2400 

cctccttccc atgcaccatg 
tagtggatgc 2460 

tgagggccgc gggggcagcg 
30 gtagtggttg 252 0 

tccaggtaga ggctgagatt 
gacataggtg 2580 

cagttgcccc ggaaggtgta 
gccgccccac 264 0 
35 atgctgcaga tgcactcgca 
gtccgacact 2700 

ttgatgggca ggtgcttgtt 
gtccagcagg 276 0 

acgacacggt tgtcacccac 
40 ccaggtctca 2820 

ttcacctgcc ggagagggat 
cgaggacagc 2 880 

ggggcggagg acactggcgg 
gtcaatgtca 2940 
45 cagtgctgat tgcacactgc 
attgtagatg 3000 

acttccccgg gcgagaaaaa 
agaaaagtgg 3060 

gagctgggga agccagtggg 
50 agtggacacg 3120 

ctgaaggttg gcaggctgct 
cacggtggac 3180 



tgcacgcaga 


tigcccauy uy 


cgcgccgaag 


tccgccatcc 


4" /«i -a /">r /-» 4- /-«f 

CCtcCayCLy 


4— r/-*f**~* 4" y~i 4- /v(>f 

tgggctctgg 


gggccgcatg 


gcc tgtacac 


l uuggcgggi: 


gcacctcgcc 


(nrj- ^>t O /*"» 4" (~T f~l "3 

agLCdt. UyCa 


CaCtccccyy 


ctcuggcagg 


y CaUL. LLaay 


y^yyLLLL uy 


gggcccgggg 


gcagaagggu 


yLyyCaCtCa 


agcagctgtg 


gtgggcatgg 


ggtgggggtg 


gcagtggggg 


gtgtgccagc 


cggggcccaa 


cacgtcttgg 


4~ 4- /"» /-i 4— 4— 

ccacgccccu 


gcaaccggcg 


tccctccggc 


cgccggcgca 


ggcgccgcac 


tagggcagcc 


gggcctggaa 


cacu uggcca 


atgtccacag 


ccatggtggt 


ggcccccagc 


ccgctgctca 


ccggaattgg 


gccaaacagg 


gtgacagtga 


ggacgatatc 


cauggacuug 


ggcggcagcg 


gcagcggcag 


aggccgcgca 


cccaaagcgc 


gcacggauct 


/-I 4— /—I 4— /ri ^ 4— -5 y-«f 

ctctCauyay 


ctgsiggtgccg 


ccaaaggcgg 


aaLaytyyya 


cccacagcgg 


aagucacagg 


gctggcccgg 


cacgcaggtg 


acgccggcca 


caggcucrgg 


gcacccggcc 


accgugcagu 


uCUCCdyyyL 


ggcac cguca 


cagccagggg 


/-« ra rrrrnrra cmct 

ca yygg a yyy 


tggggaggtg 


ggacaggcgc 


cc uggaagcg 


gtagaaatgg 


cagccggctc 


ggrcggccuu 


ctgtccaaat 


gccctgcaga 


agcagggagt 


tctcagggtg 


gtgaggactg 


aggaggacac 


ggggagcact 


gcaggggtgc 


gggtggtccc 
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gagctgggaa ccgtggaggc agtggctgtg ggcatagtgg ccatggtggt 
caccactttg 3240 

ggggtgtgag ctgttcccag agtggagga 
3269 



<210> 25 
<211> 920 
10 <212> DNA 

<213> Homo sapiens 



<400> 25 

15 tttttttttg tgagaaaagc cgttgaggtc ctctcaaatc tgcatgtgag 
caagtaggtc 60 

tacaacaggc aaacatactg cttcccgact tctgaaaagt caaggaatcc 
taagtaatct 120 
agatgaaaaa gtgactcctt ggtttcttca cattcccctc caagagccca 
20 gtgagccatc 180 

agctcctggc cagcaacaca cctgatacaa aatgggagaa gtgtataaat 
tattatgttg 240 

ataagcaaga taagtgccaa agggtagata agagtgaaag cattgggaag 
cctggcagag 3 00 
25 tgcagtcagc aggtcagttc ctggcggttc attgagcagc ttggcccaaa 
gatgtaagtg 3 60 

aagtccctcc tggtggaggg atgagagcag ctccagttac tcggccttca 
tgacacagga 420 
agttgaaggt ggcacaggca ttgggcgtgt cctgcacttc cacagcaatg 
30 ccccgctgcc 480 

tcatggcttg aagcacctgg gactgcagcc tctcggtccg gtctccagtc 
cccaccacca 540 

cgatctctat ccggggctcc agcaaccaga agagggaaaa gctgtcttcg 
gtgatgtcct 600 
35 ggtgggatcc cacgttccac tgcaccaccg agtgcgggag cagagcgcag 
gggccgagca 660 

cgcggtttcc gtttatcatg aagccgcggc tgttgtagct gtcgatgtac 
attgcctgag 720 
cggcctcgcg ttgcagcaga gagatgcgcg tccgctgata cagctcgtca 
40 tccgccggcg 780 

agagccgatg ccctcgccgc ggggcccagg gaagctcaac gggcggacag 
cgcagcgagg 84 0 

gtcgcgctcg gtacaagcta cgtagcgcga gagcggtggc catggctgac 
caacgcgcgg 900 
45 cgaagtcgtc accggcgccg 
920 



50 <210> 26 

<211> 428 
<212> DNA 
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<213> Homo sapiens 





<400> 


26 








5 


caaagggtta 


ggaggtctct 


gagaaatgag ggagccacct 


agacaggggt 




gagctcctta 


60 










tcatatttct 


gcatgccctc 


gtcagggagc 


aggacctcca 


ctgtgaagct 




gaccttcttg 


120 










gcgtggacaa 


agctataggt 


gttgtcgaag 


cgtaggacat 


ctgggggaca 


10 


gatggaaaga 


180 










ggggcattgg 


gcttggcttt 


acacactctt 


ttttccaccc 


ctggccatgg 




aggaggacag 


240 










gcttgtgttc 


ccactggctt 


tgtttgccaa 


gctgggccaa 


gctcccacca 




cgcctcttcc 


-> n a 
3 OU 








15 


cttccccatg 


ggaactgcca 


acccaagcag 


gtcctgtaaa 


cagacaccaa 




cctcctcttg 


360 










gcatgtgccc 


aggcatggct 


ttactggcat 


cgagaacaat 


ggtcattaca 




tcccattgtc 


420 










tcacacct 










20 


428 












<210> 


27 








25 


<211> 


379 










<212> 


DNA 










<213> 


Homo sapiens 






30 


<400> 


27 










ggcacgagcg 


acgagggcga 


cccctctgag 


aagctggagc 


tggtgacagg 




caccaacgtg 


60 










tacatcacaa 


gggcgcagct 


gatgaactgc 


cacgtcagcg 


caggcacgcg 




gcacaaggtc 


120 








35 


ctactgcggc 


ggctcctggc 


ctccttcttt 


gaccggaaca 


cgctggccaa 




cagctgcggc 


180 










accggcatcc 


gctcttctac 


caacgatccc 


cgtcggaagc 


ccctggacag 




ccgcgtgctc 


240 










cacgctgtca 


agtactactg 


ccagaacttc 


gcccccaact 


tcaaggagag 


40 


cgagatgaat 


300 










gccatcgcgg 


ccgacatgtg 


caccaacgcc 


cgccgcgtcg 


tgcgcaagag 




ctggatgccc 


360 










aaggtcaagg 


tgctcaagg 










379 










45 














<210> 


28 










<211> 


562 








50 


<212> 


DNA 









<213> Homo sapiens 
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<400> 28 

caattttttt ctcttttctt aaggtatcag atacacaccg gacttcaaca 
ttctatcata 60 
5 agacctaccc aacccaactg tttacctctg gacaatgcca ccctacctca 
gaaactgaag 120 

gaggttggat attcaacgca tatggtcgga aaatggcact tgggttttta 
cagaaaagaa 180 

tgcatgccca ccagaagagg atttgatacc ttttttggtt cccttttggg 
10 aagtggggat 240 

tactatacac actacaaatg tgacagtcct gggatgtgtg gctatgactt 
gtatgaaaac 300 

gacaatgctg cctgggacta tgacaatggc atatactcca cacagatgta 
cactcagaga 3 60 
15 gtacagcaaa tcttagcttc ccataacccc acaaagccta tatttttata 
tattgcctat 420 

caagctgttc attcaccact gcaagctcct ggcaggtatt tcgaacacta 
ccgatccatt 480 

atcaacataa acaggaggag atatgctgcc atgctttcct gcttagatga 
20 agcaatcaac 540 

aacgtgacat tggctctaaa ag 
562 



<210> 29 

<211> 577 

<212> DNA 

<213> Homo sapiens 

30 

<400> 29 

cacagacgtg aatctgaaga atggaagtgg caaggacagt ctaatgctgg 
cgtgctatgc 60 
35 gggacaccta gatgttgtga aatatctccg aagacatggc gcttcttggc 
aggctagaga 120 

cctgggaggc tgtacagctc tgcactgggc tgcagatgga ggccactgca 
gtgtgattga 180 

gtggatgata aaggatggct gtgaggtaga cgtcgtggac actggttcag 
40 gatggacccc 240 

actcatgaga gtctctgcgg tgtcgggaaa tcagagggtg gcctctcttc 
taattgatgc 300 

tggggccaat gtgaatgtga aggacagaaa tggaaagacg ccccttatga 
gtgtagtctc 360 
45 cttattagaa gaaaggaaaa aaaagcagag gccaaagaag tcttgtgtct 
gctgatgaga 420 

gcaccactca tctgcgaaac gcacgtaaaa caaagtgaac cgtgactgtt 
aaactaggga 480 

tgggaaattc tgcatcttgg ggggctgtac atttatttat ttagttgaag 
50 attcactgat 540 

cccactttga aatacccctt tttacctaaa aaaaaaa 
577 
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<210> 


30 








5 


<211> 


1083 










<212> 


DNA 










<213> 


Homo sapiens 






10 


<400> 


30 










attcccgggt 


cgacggctgc 


gagaagacga 


cagaagggga 


aggatggagc 




cggggccaag 


60 










gaggctgata 


agggcacaag 


caaagccaaa 


gcagaggagg 


atgaggaggc 




cgaacgtatc 


120 








15 


cttcctggga 


tcctcaatga 


caccctgcgc 


ttcggcatgt 


tcctgcagga 




tctcatcttc 


180 










atgatcagtg 


aggagggcgc 


tgtggcctac 


gatccagcag 


ccaacgagtg 




ctactgtgct 


240 










tccctctcca 


gccaggtccc 


caagaaccac 


gtcagcctgg 


ttaccaagga 


20 


gaaccaggtc 


300 










ttcgtggctg 


gaggcctctt 


ctacaacgaa 


gacaacaaag 


aggaccccat 




gagcgcatac 


360 










ttcctgcagt 


ttgaccatct 


ggactcagag 


tggctgggga 


tgccaccgct 




gccctcgccc 


420 








25 


cgctgcctct 


ttggcctggg 


agaagctctc 


aactccatct 


acgtggtcgg 




tggcagagag 


480 










atcaaggacg 


gcgagcgctg 


cctggactcg 


gtcatgtgct 


acgacaggct 




gtcattcaaa 


540 










tggggtgaat 


cggacccgct 


gccttacgtg 


gtgtatggcc 


acacagtgct 


30 


ctcccacatg 


600 










gaccttgtct 


acgtaattgg 


cggcaaaggc 


agtgacagga 


agtgcctgaa 




caagatgtgc 


660 










gtctatgacc 


ccaagaagtt 


tgagtggaag 


gagctggcac 


ccatgcagac 




cgcccgctca 


720 








35 


ctctttgggg 


ccactgtcca 


tgatggccgc 


attatcgtgg 


cagctggggt 




caccgacaca 


780 










gggctgacca 


gttctgccga 


agtgtacagc 


atcacagaca 


acaagtgggc 




acccttcgag 


840 










gccttcccac 


aggagcgtag 


ctcactcagc 


ctggtcagcc 


tggtgggtac 


40 


cctctatgcc 


900 










attggtggct 


ttgccacact 


ggagacggag 


tctggagagc 


tggttcccac 




agagctcaat 


960 










gacatctgga 


ggtataacga 


ggaggagaag 


aaatgggagg gtgtcctgcg 




ggagatcgcc 


1020 








45 


tatgcagcag 


gtgccacctt 


cctaccagtg 


cggctcaatg 


tgctgtgcct 




gactaagatg 


1080 









tga 
1083 

50 

<210> 31 
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<211> 419 
<212> DNA 
<213> Homo sapiens 

5 

<400> 31 

ttaatcctga ctgggagaag aaagtaattg agtattttaa ggaaaagctg 
aaggaaaata 60 
atgctcctaa gtgggtacca tcactgaacg aagttcccct tcattatttg 
10 aaacctaata 120 

gttttgtgaa atttcgttgc atgattcagg atatgtttga ccctgagttt 
tacatgggag 180 

tttatgaaac ggttaaccaa aacacaaaag cacatgttct tcattttgga 
aaatatagag 240 
15 atgtagcaga gtgtgggcct caacaagaac ttgatttaaa ctctccacga 
aataccactt 300 

tggaaagaca gactttctat tgtgttccgg tgcctgggga atctacgtgg 
gtaaaagaag 360 
cctatgttaa tgcaaaccaa gctcgagtca gtccctcaac atctacactc 
20 ctagtcgcc 419 



<210> 32 
25 <211> 406 

<212> DNA 
<213> Homo sapiens 



30 <400> 32 

tttcgtgcgc catgtgctcc gggctcctgt agctcttgct gcccatctgg 
ctctcctgga 60 

ccctggggac ccgaggctct gagcctcgca gtgtgaacga tcccgggaac 
atgtcctttg 120 
35 tgaaggagac ggtggacaag ctgctgacag gcttccgctg cttcagagag 
cgggaggccg 180 

cgcctaggcg ggctctcaga ggagccgctc taccgggaga gtcggaggct 
ggcgacccag 240 

agtcacttag gtcttctgtg aatgcagact ggattcaata ttctgacctg 
40 tgggaagcgg 300 

aggtcagtac cccgaggtgc gaagcgggct tttgccagga gtgctttagg 
acgccaggga 360 

atcaggagaa ggatggccct ttcatttgtt aattgagctg aaacgg 
406 



<210> 33 
<211> 476 
50 <212> DNA 

<213> Homo sapiens 
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<400> 33 

ccaagataga tcaactctcc ctaaaggctg acagtgaact cttggggccg 
ttttattctc 60 
5 tgaggttagc aaggagtcat ctactagcca ttcaggaggc cagctgggaa 
gacaaaatag 120 

gcaccccaaa ctcagcaact tcataacacc ttcctctccc cgcctgaagc 
cttaaactgc 180 

atcaagtcaa agaaacctgg ggcaaatcct taacatgttt ttgactgcag 
10 taaatccaca 240 

gccactctct actccgagct ggcagattga gaccaagtat tcaacgaaag 
tgctcactgg 300 

aaattggatg gaagagagga gaaaggggct accttacaaa cacctgatca 
cccaccacca 360 
15 ggagccccca catcgctacc tgatcagcac ctatgacgac cattacaacc 
ggcatggtta 420 

caacccgggg ctgcctccac tccgcacttg gaatggacag aagttgctgt ggctgc 
476 

20 

<210> 34 
<211> 874 
<212> PRT 
25 <213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (874) 
30 <223>.Xaa = any amino acid or symbol as shown in the 

table 8 as set forth in Example 2 



<400> 34 

35 He Asn Glu Cys Val Leu Ser Pro Cys Arg His Gly Ala Ser Cys 
Gin 

15 10 15 

Asn Thr His Gly Gly Tyr Arg Cys His Cys Gin Ala Gly Tyr Ser 
Gly 

40 20 25 30 

Arg Asn Cys Glu Thr Asp He Asp Asp Cys Arg Pro Asn Pro Cys 
His 

35 40 45 

Asn Gly Gly Ser Cys Thr Asp Gly He Asn Thr Ala Phe Cys Asp 
45 Cys 

50 55 60 

Leu Pro Gly Phe Arg Gly Thr Phe Cys Glu Glu Asp He Asn Glu 
Cys 

65 70 75 

50 80 

Ala Ser Asp Pro Cys Arg Asn Gly Ala Asn Cys Thr Asp Cys Val 
Asp 

100 
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Ser Tyr Thr 
Glu 

5 Asn Asn Thr 
Thr 

115 

Cys Val Asp 
Phe 

10 130 

Thr Gly Ser 
Pro 

145 
160 

15 Cys Leu Leu 
Cys 

Thr Cys Pro 
His 

20 

Trp Cys Asp 
Thr 

195 

His Thr Gin 
25 Tyr 

210 
Cys Asp Val 
Gly 
225 
30 240 

Val Asp Val 
Ala 

Gly Asn Thr 
35 Tyr 

Cys Glu Asp 
Gly 

275 

40 Ala Thr Cys 
Ala 

290 

Gly Tyr His 
Ser 
45 305 
320 

His Pro Cys 
Tyr 

50 Lys Cys Ser 
Asn 



85 

Cys Thr Cys Pro 
100 

Pro Asp Cys Thr 

Gly lie Asn Ser 
135 

Tyr Cys Gin His 
150 

Gly Gly Thr Cys 
165 

Gin Gly Tyr Thr 
180 

Ser Ser Pro Cys 

Tyr Arg Cys Glu 
215 

Pro Ser Val Ser 
230 

Ala Arg Leu Cys 
245 

His His Cys Arg 
260 

Leu Val Asp Glu 

Thr Asp Tyr Leu 
295 

Gly Val Asn Cys 
310 

Gin Asn Gly Gly 
325 

Cys Pro Arg Gly 
340 



90 

Ala Gly Phe Ser 
105 

Glu Ser Ser Cys 
120 

Phe Thr Cys Leu 

Val Val Asn Glu 
155 

Gin Asp Gly Arg 
170 

Gly Pro Asn Cys 
185 

Lys Asn Gly Gly 
200 

Cys Pro Ser Gly 

Cys Glu Val Ala 
235 

Gin His Gly Gly 
250 

Cys Gin Ala Gly 
265 

Cys Ser Pro Ser 
280 

Gly Gly Tyr Ser 

Ser Glu Glu lie 
315 

Thr Cys Leu Asp 
330 

Thr Gin Gly Val 
345 

101 
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95 

Gly lie His Cys 
110 

Phe Asn Gly Gly 
125 

Cys Pro Pro Gly 
140 

Cys Asp Ser Arg 



Gly Leu His Arg 
175 

Gin Asn Leu Val 
190 

Lys Cys Trp Gin 
205 

Trp Thr Gly Leu 
220 

Ala Gin Arg Gin 



Leu Cys Val Asp 
255 

Tyr Thr Gly Ser 
270 

Pro Cys Gin Asn 
285 

Cys Lys Cys Val 
300 

Asp Glu Cys Leu 



Leu Pro Asn Thr 
335 

His Cys Glu lie 
350 
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Val Asp Asp 
Lys 

355 

Cys Phe Asn 
5 Cys 

370 

Thr Cys Pro 
Asn 

385 
10 400 

Glu Cys Leu 
Val 

Gin Arg Val 
15 Gly 

Arg Arg Cys 
Lys 

435 

20 Asn Gly Gly 
He 

450 
Cys Lys Cys 
Ala 
25 465 
480 

Arg Thr Cys 
Ser 

30 Gly Pro Arg 
Pro 

Glu Cys Gin 
Cys 

35 515 
Tyr Asn Gin 
Arg 

530 

Cys Leu Cys 
40 Asp 
545 
560 

Tyr Ser Phe 
He 

45 

Glu Glu Ala 
Lys 

Val Cys Ser 
50 Gly 

595 



Cys Asn Pro Pro 

Asn Gly Thr Cys 
375 

Pro Gly Phe Val 
390 

Ser Asn Pro Cys 
405 

Asn Asp Phe His 
420 

Glu Ser Val He 

Thr Cys Ala Val 
455 

Pro Ala Gly Phe 
470 

Gly Ser Leu Arg 
485 

Ser Pro Thr Cys 
500 

Phe Pro Ala Ser 

Gly Thr Cys Glu 
535 

Pro Ala Lys Phe 
550 

Gly Gly Gly Ala 
565 

Cys Glu Leu Pro 
580 

Leu Gin Cys Asn 



Val Asp Pro Val 
360 

Val Asp Gin Val 

Gly Glu Arg Cys 
395 

Asp Ala Arg Gly 
410 

Cys Glu Cys Arg 
425 

Asn Gly Cys Lys 
440 

Ala Ser Asn Thr 

Glu Gly Ala Thr 
475 

Cys Leu Asn Gly 
490 

Leu Cys Leu Gly 
505 

Ser Pro Cys Leu 
520 

Pro Thr Ser Glu 

Asn Gly Leu Leu 
555 

Gly Arg Asp He 
570 

Glu Cys Gin Glu 
585 

Asn His Ala Cys 
600 
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Ser Arg Ser Pro 

365 

Gly Gly Tyr Ser 
380 

Glu Gly Asp Val 



Thr Gin Asn Cys 
415 

Ala Gly His Thr 
430 

Gly Lys Pro Cys 
445 

Ala Arg Gly Phe 
460 

Cys Glu Asn Asp 



Gly Thr Cys He 
495 

Pro Phe Thr Gly 
510 

Gly Gly Asn Pro 
525 

Ser Pro Phe Tyr 
540 

Cys His He Leu 



Pro Pro Pro Leu 
575 

Asp Ala Gly Asn 
590 

Gly Trp Asp Gly 
605 
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Asp Cys Ser 
Ser 

610 
Leu Gin Cys 
5 Cys 
625 
640 

Asn Ser Ala 
Glu 

10 

Gly Gin Cys 
Ser 

Asp Gly His 
15 Asp 

675 

Gly Leu Asp 
Thr 

690 

20 Arg Xaa Val 
Ser 

705 
720 
Phe His Phe 
25 Val 

Val Phe Lys 
Tyr 

30 Gly Arg Glu 
Glu 

755 

Gly Trp Ala 
Leu 

35 770 

Leu Pro Gly 
Pro 

785 
800 

40 Met Asp Val 
Gin 

Cys Val Gin 
Ala 

45 

Ala Phe Leu 
Tyr 

835 

Lys lie Glu 
50 Ala 

850 
Gin Leu His 



Leu Asn Phe Asn 
615 

Trp Lys Tyr Phe 
630 

Gly Cys Leu Phe 
645 

Asn Pro Leu Tyr 
660 

Cys Asp Gin Gly 

Cys Ala Glu His 
695 

Val Val Leu Met 
710 

Pro Ala Gly Val 
725 

Arg Asp Ala His 
740 

Glu Glu Leu Arg 

Ala Pro Asp Ala 

775 

Gly Ser Glu Gly 
790 

Arg Gly Ser lie 
805 

Ala Ser Ser Gin 
820 

Gly Ala Leu Ala 

Ala Val Gin Ser 
855 

Phe Met Tyr Val 



Asp Pro Trp Lys 

Ser Asp Gly His 
635 

Asp Gly Phe Asp 
650 

Asp Gin Tyr Cys 
665 

Cys Asn Ser Ala 
680 

Val Pro Glu Arg 

Pro Pro Glu Gin 
715 

Gin Ala Arg Val 
730 

Gly Gin Gin Met 
745 

Lys His Pro lie 
760 

Leu Leu Gly Gin 

Gly Arg Arg Arg 
795 

Val Tyr Leu Glu 
810 

Cys Phe Gin Ser 
825 

Ser Leu Gly Ser 
840 

Glu Thr Val Glu 

Ala Ala Ala 
103 
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Asn Cys Thr Gin 

620 

Cys Asp Ser Gin 



Cys Gin Arg Ala 
655 

Lys Asp His Phe 
670 

Glu Cys Glu Trp 
685 

Leu Ala Ala Gly 
700 

Leu Arg Asn Ser 



Leu His Thr Asn 
735 

lie Phe Pro Tyr 
750 

Lys Arg Ala Ala 
765 

Val Lys Ala Ser 
780 

Arg Glu Leu Asp 



lie Asp Asn Arg 
815 

Ala Thr Asp Val 
830 

Leu Asn lie Pro 
845 

Pro Pro Pro Pro 
860 



WO 02/077180 
865 
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5 <210> 35 

<211> 895 
<212> PRT 

<213> Homo sapiens 

10 

<400> 35 

Ser Ser Thr Leu Gly Thr Ala His Thr Pro Lys Val Val Thr Thr 
Met 

15 10 15 

15 Ala Thr Met Pro Thr Ala Thr Ala Ser Thr Val Pro Ser Ser Ser 
Thr 

20 25 30 

Val Gly Thr Thr Arg Thr Pro Ala Val Leu Pro Ser Ser Leu Pro 
Thr 

20 35 40 45 

Phe Ser Val Ser Thr Val Ser Ser Ser Val Leu Thr Thr Leu Arg 
Pro 

50 55 60 

Thr Gly Phe Pro Ser Ser His Phe Ser Thr Pro Cys Phe Cys Arg 
25 Ala 

65 70 75 

80 

Phe Gly Gin Phe Phe Ser Pro Gly Glu Val He Tyr Asn Lys Thr 
Asp 

30 85 90 95 

Arg Ala Gly Cys His Phe Tyr Ala Val Cys Asn Gin His Cys Asp 
He 

100 105 110 

Asp Arg Phe Gin Gly Ala Cys Pro Thr Ser Pro Pro Pro Val Ser 
35 Ser 

115 120 125 

Ala Pro Leu Ser Ser Pro Ser Pro Ala Pro Gly Cys Asp Asn Ala 
He 

130 135 140 

40 Pro Leu Arg Gin Val Asn Glu Thr Trp Thr Leu Glu Asn Cys Thr 
Val 

145 150 155 

160 

Ala Arg Cys Val Gly Asp Asn Arg Val Val Leu Leu Asp Pro Lys 
45 Pro 

165 170 175 

Val Ala Asn Val Thr Cys Val Asn Lys His Leu Pro He Lys Val 
Ser 

180 185 190 

50 Asp Pro Ser Gin Pro Cys Asp Phe His Tyr Glu Cys Glu Cys lie 
Cys 

195 200 205 

104 
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Ser Met Trp Gly Gly Ser His Tyr Ser Thr Phe Asp Gly Thr Ser 
Tyr 

210 215 220 

Thr Phe Arg Gly Asn Cys Thr Tyr Val Leu Met Arg Glu lie His 
5 Ala 

225 230 235 

240 

Arg Phe Gly Asn Leu Ser Leu Tyr Leu Asp Asn His Tyr Cys Thr 
Ala 

10 245 250 255 

Ser Ala Thr Ala Ala Ala Ala Arg Cys Pro Ala Ala Leu Ser lie 
His 

260 265 270 

Tyr Lys Ser Met Asp lie Val Leu Thr Val Thr Met Val His Gly 
15 Lys 

275 280 285 

Glu Glu Gly Leu lie Leu Phe Asp Pro lie Pro Val Ser Ser Gly 
Phe 

290 295 300 

20 Ser Lys Asn Gly Val Leu Val Ser Val Leu Gly Thr Thr Thr Met 
Ala 

305 310 315 

320 

Val Asp lie Pro Ala Leu Gly Val Ser Val Thr Phe Asn Gly Gin 
25 Val 

325 330 335 

Phe Gin Ala Arg Leu Pro Tyr Ser Leu Phe His Asn Asn Thr Glu 
Gly 

340 345 350 

30 Gin Cys Gly Thr Cys Thr Asn Asn Gin Arg Asp Asp Cys Leu Gin 
Arg 

355 360 365 

Asp Gly Thr Thr Ala Ala Ser Cys Lys Asp Met Ala Lys Thr Trp 
Leu 

35 370 375 380 

Val Pro Asp Ser Arg Lys Asp Gly Cys Trp Ala Pro Thr Gly Thr 
Pro 

385 390 395 

400 

40 Pro Thr Ala Ser Pro Ala Ala Pro Val Ser Ser Thr Pro Thr Pro 
Thr 

405 410 415 

Pro Cys Pro Pro Gin Leu Leu Cys Asp Leu Met Leu Ser Gin Val 
Phe 

45 420 425 430 

Ala Glu Cys His Thr Leu Leu Pro Pro Gly Pro Phe Phe Asn Ala 
Cys 

435 440 445 

lie Ser Asp His Cys Arg Gly Arg Leu Glu Val Pro Cys Gin Ser 
50 Leu 

450 455 460 
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Glu Ala Tyr Ala Glu Leu Cys Arg Ala Arg Gly Val Cys Ser Asp 
Trp 

465 470 475 

480 

5 Arg Gly Ala Thr Gly Gly Leu Cys Asp Leu Thr Cys Pro Pro Thr 
Lys 

485 490 495 

Val Tyr Lys Pro Cys Gly Pro He Gin Pro Ala Thr Cys Asn Ser 
Arg 

10 500 505 510 

Asn Gin Ser Pro Gin Leu Glu Gly Met Ala Glu Gly Cys Phe Cys 
Pro 

515 520 525 

Glu Asp Gin He Leu Phe Asn Ala His Met Gly He Cys Val Gin 
15 Ala 

530 535 540 

Cys Pro Cys Val Gly Pro Asp Gly Phe Pro Lys Phe Pro Gly Glu 
Arg 

545 550 555 

20 560 

Trp Val Ser Asn Cys Gin Ser Cys Val Cys Asp Glu Gly Ser Val 
Ser 

565 570 575 

Val Gin Cys Lys Pro Leu Pro Cys Asp Ala Gin Gly Gin Pro Pro 
25 Pro 

580 585 590 

Cys Asn Arg Pro Gly Phe Val Thr Val Thr Arg Pro Arg Ala Glu 
Asn 

595 600 605 

30 Pro Cys Cys Pro Glu Thr Val Cys Val Cys Asn Thr Thr Thr Cys 
Pro 

610 615 620 

Gin Ser Leu Pro Val Cys Pro Pro Gly Gin Glu Ser He Cys Thr 
Gin 

35 625 630 635 

640 

Glu Glu Gly Asp Cys Cys Pro Thr Phe Arg Cys Arg Pro Gin Leu 
Cys 

645 650 655 

40 Ser Tyr Asn Gly Thr Phe Tyr Gly Val Gly Ala Thr Phe Pro Gly 
Ala 

660 665 670 

Leu Pro Cys His Met Cys Thr Cys Leu Ser Gly Asp Thr Gin Asp 
Pro 

45 675 680 685 

Thr Val Gin Cys Gin Glu Asp Ala Cys Asn Asn Thr Thr Cys Pro 
Gin 

690 695 700 

Gly Phe Glu Tyr Lys Arg Val Ala Gly Gin Cys Cys Gly Glu Cys 
50 Val 

705 710 715 

720 
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Gin Thr Ala Cys Leu Thr Pro Asp Gly Gin Pro Val Gin Leu Asn 
Glu 

725 730 735 

Thr Trp Val Asn Ser His Val Asp Asn Cys Thr Val Tyr Leu Cys 
5 Glu 

740 745 750 

Ala Glu Gly Gly Val His Leu Leu Thr Pro Gin Pro Ala Ser Cys 
Pro 

755 760 765 

10 Asp Val Ser Ser Cys Arg Gly Ser Leu Arg Lys Thr Gly Cys Cys 
Tyr 

770 775 780 

Ser Cys Glu Glu Asp Ser Cys Gin Val Arg lie Asn Thr Thr lie 
Leu 

15 785 790 795 

800 

Trp His Gin Gly Cys Glu Thr Glu Val Asn lie Thr Phe Cys Glu 
Gly 

805 810 815 

20 Ser Cys Pro Gly Ala Ser Lys Tyr Ser Ala Glu Ala Gin Ala Met 
Gin 

820 825 830 

His Gin Cys Thr Cys Cys Gin Glu Arg Arg Val His Glu Glu Thr 
Val 

25 835 840 845 

Pro Leu His Cys Pro Asn Gly Ser Ala lie Leu His Thr Tyr Thr 
His 

850 855 860 

Val Asp Glu Cys Gly Cys Thr Pro Phe Cys Val Pro Ala Pro Met 
30 Ala 

865 870 875 

880 

Pro Pro His Thr Arg Gly Phe Pro Ala Gin Glu Ala Thr Ala Val 
885 890 895 

35 



<210> 36 
<211> 196 
40 <212> PRT 

<213> Homo sapiens 



<400> 36 

45 Ala Pro Val Thr Thr Ser Pro Arg Val Gly Gin Pro Trp Arg Thr 
Ala 

15 10 15 

Leu Ala Leu Arg Ser Leu Tyr Arg Ala Arg Pro Ser Leu Arg Cys 
Pro 

50 20 25 30 

Pro Val Glu Leu Pro Trp Ala Pro Arg Arg Gly His Arg Leu Ser 
Pro 
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35 40 45 

Ala Asp Asp Glu Leu Tyr Gin Arg Thr Arg lie Ser Leu Leu Gin 
Arg 

50 55 60 

5 Glu Ala Ala Gin Ala Met Tyr He Asp Ser Tyr Asn Ser Arg Gly 
Phe 

65 70 75 

80 

Met He Asn Gly Asn Arg Val Leu Gly Pro Cys Ala Leu Leu Pro 
10 His 

85 90 95 

Ser Val Val Gin Trp Asn Val Gly Ser His Gin Asp He Thr Glu 
Asp 

100 105 110 

15 Ser Phe Ser Leu Phe Trp Leu Leu Glu Pro Arg He Glu He Val 
Val 

115 120 125 

Val Gly Thr Gly Asp Arg Thr Glu Arg Leu Gin Ser Gin Val Leu 
Gin 

20 130 135 140 

Ala Met Arg Gin Arg Gly He Ala Val Glu Val Gin Asp Thr Pro 
Asn 

145 150 155 

160 

25 Ala Cys Ala Thr Phe Asn Phe Leu Cys His Glu Gly Arg Val Thr 
Gly 

165 170 175 

Ala Ala Leu lie Pro Pro Pro Gly Gly Thr Ser Leu Thr Ser Leu 
Gly 

30 180 185 190 

Gin Ala Ala Gin 
195 



35 

<210> 37 
<211> 113 
<212> PRT 

<213> Homo sapiens 

40 



<400> 37 

Ser His Ala Trp Ala His Ala Lys Arg Arg Leu Val Ser Val Tyr 
Arg 

45 1 5 10 15 

Thr Cys Leu Gly Trp Gin Phe Pro Trp Gly Arg Glu Glu Ala Trp 
Trp 

20 25 30 

Glu Leu Gly Pro Ala Trp Gin Thr Lys Pro Val Gly Thr Gin Ala 
50 Cys 

35 40 45 
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Pro Pro Pro Trp Pro Gly Val 
Pro 

50 55 
Asn Ala Pro Leu Ser lie Cys 
5 Asn 

65 70 

80 

Thr Tyr Ser Phe Val His Ala 
Val 

10 85 

Leu Leu Pro Asp Glu Gly Met 
Pro 

100 

Val 

15 
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Glu Lys Arg Val Cys Lys Ala Lys 

60 

Pro Pro Asp Val Leu Arg Phe Asp 
75 

Lys Lys Val Ser Phe Thr Val Glu 

90 95 
Gin Lys Tyr Asp Lys Glu Leu Thr 

105 110 



<210> 38 

20 <211> 126 

<212> PRT 

<213> Homo sapiens 



25 <400> 38 

Gly Thr Ser Asp Glu Gly Asp Pro Ser Glu Lys Leu Glu Leu Val 
Thr 

15 10 15 

Gly Thr Asn Val Tyr lie Thr Arg Ala Gin Leu Met Asn Cys His 
30 Val 

20 25 30 

Ser Ala Gly Thr Arg His Lys Val Leu Leu Arg Arg Leu Leu Ala 
Ser 

35 40 45 

35 Phe Phe Asp Arg Asn Thr Leu Ala Asn Ser Cys Gly Thr Gly lie 
Arg 

50 55 60 

Ser Ser Thr Asn Asp Pro Arg Arg Lys Pro Leu Asp Ser Arg Val 
Leu 

40 65 70 75 

80 

His Ala Val Lys Tyr Tyr Cys Gin Asn Phe Ala Pro Asn Phe Lys 
Glu 

85 90 95 

45 Ser Glu Met Asn Ala lie Ala Ala Asp Met Cys Thr Asn Ala Arg 
Arg 

100 105 110 

Val Val Arg Lys Ser Trp Met Pro Lys Val Lys Val Leu Lys 
115 120 125 

50 
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<210> 39 
<211> 187 
<212> PRT 

<213> Homo sapiens 

5 

<400> 39 

Gin Phe Phe Ser Leu Phe Leu Arg Tyr Gin He His Thr Gly Leu 
Gin 

10 1 5 10 15 

His Ser He He Arg Pro Thr Gin Pro Asn Cys Leu Pro Leu Asp 
Asn 

20 25 30 

Ala Thr Leu Pro Gin Lys Leu Lys Glu Val Gly Tyr Ser Thr His 
15 Met 

35 40 45 

Val Gly Lys Trp His Leu Gly Phe Tyr Arg Lys Glu Cys Met Pro 
Thr 

50 55 60 

20 Arg Arg Gly Phe Asp Thr Phe Phe Gly Ser Leu Leu Gly Ser Gly 
Asp 

65 70 75 

80 

Tyr Tyr Thr His Tyr Lys Cys Asp Ser Pro Gly Met Cys Gly Tyr 
25 Asp 

85 90 95 

Leu Tyr Glu Asn Asp Asn Ala Ala Trp Asp Tyr Asp Asn Gly He 
Tyr 

100 105 110 

30 Ser Thr Gin Met Tyr Thr Gin Arg Val Gin Gin lie Leu Ala Ser 
His 

115 120 125 

Asn Pro Thr Lys Pro He Phe Leu Tyr He Ala Tyr Gin Ala Val 
His 

35 130 135 140 

Ser Pro Leu Gin Ala Pro Gly Arg Tyr Phe Glu His Tyr Arg Ser 
He 

145 150 155 

160 

40 He Asn lie Asn Arg Arg Arg Tyr Ala Ala Met Leu Ser Cys Leu 
Asp 

165 170 175 

Glu Ala He Asn Asn Val Thr Leu Ala Leu Lys 
180 185 

45 



<210> 40 
<211> 137 
50 <212> PRT 

<213> Homo 



sapiens 
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<400> 40 

Thr Asp Val Asn Leu Lys Asn Gly Ser Gly Lys Asp Ser Leu Met 
Leu 

5 1 5 10 15 

Ala Cys Tyr Ala Gly His Leu Asp Val Val Lys Tyr Leu Arg Arg 
His 

20 25 30 

Gly Ala Ser Trp Gin Ala Arg Asp Leu Gly Gly Cys Thr Ala Leu 
10 His 

35 40 45 

Trp Ala Ala Asp Gly Gly His Cys Ser Val He Glu Trp Met He 
Lys 

50 55 60 

15 Asp Gly Cys Glu Val Asp Val Val Asp Thr Gly Ser Gly Trp Thr 
Pro 

65 70 75 

80 

Leu Met Arg Val Ser Ala Val Ser Gly Asn Gin Arg Val Ala Ser 
20 Leu 

85 90 95 

Leu He Asp Ala Gly Ala Asn Val Asn Val Lys Asp Arg Asn Gly 
Lys 

100 105 110 

25 Thr Pro Leu Met Ser Val Val Ser Leu Leu Glu Glu Arg Lys Lys 
Lys 

115 120 125 

Gin Arg Pro Lys Lys Ser Cys Val Cys 
130 135 

30 



<210> 41 
<211> 360 
35 <212> PRT 

<213> Homo sapiens 



<400> 41 

40 He Pro Gly Ser Thr Ala Ala Arg Arg Arg Gin Lys Gly Lys Asp 
Gly 

15 10 15 

Ala Gly Ala Lys Glu Ala Asp Lys Gly Thr Ser Lys Ala Lys Ala 
Glu 

45 20 25 30 

Glu Asp Glu Glu Ala Glu Arg lie Leu Pro Gly He Leu Asn Asp 
Thr 

35 40 45 

Leu Arg Phe Gly Met Phe Leu Gin Asp Leu He Phe Met He Ser 
50 Glu 

50 55 60 



ill 



10 



45 
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Glu 


Gly Ala Val Ala Tyr Asp 


Pro Ala 


Ala 


Asn 


Glu 


Cys 


Tyr Cys 


Ala 


















65 




70 






75 








80 


















Ser 


Leu 


Ser Ser Gin Val Pro 


Lys Asn 


His 


Val 


Ser 


Leu 


Val Thr 


Lys 




85 




90 








95 


Glu 


Asn 


Gin Val Phe Val Ala Gly Gly Leu Phe Tyr Asn Glu Asp 


Asn 




100 


105 










110 


Lys 


Glu 


Asp Pro Met Ser Ala 


Tyr Phe 


Leu 


Gin 


Phe 


Asp 


His Leu 


Asp 




115 


120 








125 




Ser 


Glu 


Trp Leu Gly Met Pro 


Pro Leu 


Pro 


Ser 


Pro Arg 


Cys Leu 


Phe 


130 


135 








140 






Gly 


Leu Gly Glu Ala Leu Asn 


Ser He 


Tyr 


Val 


Val 


Gly Gly Arg 


Glu 


















145 




150 






155 








160 


















He 


Lys 


Asp Gly Glu Arg Cys 


Leu Asp 


Ser 


Val 


Met 


Cys 


Tyr Asp 


Arg 




165 




170 








175 


Leu 


Ser 


Phe Lys Trp Gly Glu 


Ser Asp 


Pro 


Leu 


Pro 


Tyr 


Val Val 


Tyr 




180 


185 










190 


Gly 


His 


Thr Val Leu Ser His 


Met Asp Leu 


Val 


Tyr 


Val 


He Gly 


Gly 




195 


200 








205 




Lys 


Gly 


Ser Asp Arg Lys Cys 


Leu Asn 


Lys 


Met 


Cys 


Val 


Tyr Asp 


Pro 


210 


215 








220 






Lys 


Lys 


Phe Glu Trp Lys Glu 


Leu Ala 


Pro 


Met 


Gin 


Thr 


Ala Arg 


Ser 


















225 




230 






235 








240 


















Leu 


Phe 


Gly Ala Thr Val His Asp Gly Arg 


He 


He 


Val 


Ala Ala 


Gly 




245 




250 








255 


Val 


Thr Asp Thr Gly Leu Thr Ser Ser Ala Glu Val Tyr 


Ser lie 


Thr 




260 


265 










270 


Asp 


Asn 


Lys Trp Ala Pro Phe 


Glu Ala 


Phe 


Pro Gin Glu Arg Ser 


Ser 




275 


280 








285 




Leu 


Ser 


Leu Val Ser Leu Val 


Gly Thr 


Leu 


Tyr 


Ala 


He 


Gly Gly 


Phe 


290 


295 








300 






Ala 


Thr 


Leu Glu Thr Glu Ser Gly Glu Leu 


Val 


Pro 


Thr 


Glu Leu 


Asn 


















305 




310 






315 








320 
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Asp He Trp Arg Tyr Asn Glu Glu Glu Lys Lys Trp Glu Gly Val 
Leu 

325 330 335 

Arg Glu He Ala Tyr Ala Ala Gly Ala Thr Phe Leu Pro Val Arg 
5 Leu 

340 345 350 

Asn Val Leu Cys Leu Thr Lys Met 
355 360 

10 

<210> 42 
<211> 139 
<212> PRT 
15 <213> Homo sapiens 



<400> 42 

Asn Pro Asp Trp Glu Lys Lys Val He Glu Tyr Phe Lys Glu Lys 
20 Leu 

15 10 15 

Lys Glu Asn Asn Ala Pro Lys Trp Val Pro Ser Leu Asn Glu Val 
Pro 

20 25 30 

25 Leu His Tyr Leu Lys Pro Asn Ser Phe Val Lys Phe Arg Cys Met 
He 

35 40 45 

Gin Asp Met Phe Asp Pro Glu Phe Tyr Met Gly Val Tyr Glu Thr 
Val 

30 50 55 60 

Asn Gin Asn Thr Lys Ala His Val Leu His Phe Gly Lys Tyr Arg 
Asp 

65 70 75 

80 

35 Val Ala Glu Cys Gly Pro Gin Gin Glu Leu Asp Leu Asn Ser Pro 
Arg 

85 90 95 

Asn Thr Thr Leu Glu Arg Gin Thr Phe Tyr Cys Val Pro Val Pro 
Gly 

40 100 105 110 

Glu Ser Thr Trp Val Lys Glu Ala Tyr Val Asn Ala Asn Gin Ala 
Arg 

115 120 125 

Val Ser Pro Ser Thr Ser Thr Leu Leu Val Ala 
45 130 135 



<210> 43 
50 <211> 129 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<222> (1) . . . (129) 
5 <223> Xaa = any amino acid or symbol as shown in the 

table 8 as set forth in Example 2 



10 Ser 
Trp 

1 

Leu 
Asn 

15 

Asp 
Leu 

Thr 
20 Ala 

Leu 
Glu 
65 

25 80 

Ser 
Leu 

Trp 
30 Gin 

Glu 
He 

35 Cys 



<400> 43 
Cys Ala Met Cys 



Ser Gly Leu Leu Xaa Leu Leu Leu Pro He 



5 10 15 

Ser Trp Thr Leu Gly Thr Arg Gly Ser Glu Pro Arg Ser Val 



25 30 

Ser Phe Val Lys Glu Thr Val Asp Lys Leu 

40 45 

Phe Arg Glu Arg Glu Ala Ala Pro Arg Arg 



20 

Pro Gly Asn Met 
35 

Gly Phe Arg Cys 

50 55 60 

Arg Gly Ala Ala Leu Pro Gly Glu Ser Glu Ala Gly Asp Pro 

70 75 

Leu Arg Ser Ser Val Asn Ala Asp Trp He Gin Tyr Ser Asp 



85 

Glu Ala Glu Val 
100 

Cys Phe Arg Thr 
115 



90 95 
Ser Thr Pro Arg Cys Glu Ala Gly Phe Cys 

105 110 
Pro Gly Asn Gin Glu Lys Asp Gly Pro Phe 



120 



125 



40 <210> 44 

<211> 146 
<212> PRT 

<213> Homo sapiens 

45 <220> 

<221> misc_feature 
<222> (1) . . . (146) 

<223> Xaa = any amino acid or symbol as shown in the 
table 8 as set forth in Example 2 

50 

<400> 44 
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Thr Leu Gly Ala Val Leu Phe Ser Glu Val Ser Lys Glu Ser Ser 
Thr 

15 10 15 

Ser His Ser Gly Gly Gin Leu Gly Arg Gin Asn Arg His Pro Lys 
5 Leu 

20 25 30 

Ser Asn Phe He Thr Pro Ser Ser Pro Arg Leu Lys Pro Xaa Thr 
Ala 

35 40 45 

10 Ser Ser Gin Arg Asn Leu Gly Gin He Leu Asn Met Phe Leu Thr 
Ala 

50 55 60 

Val Asn Pro Gin Pro Leu Ser Thr Pro Ser Trp Gin He Glu Thr 
Lys 

15 65 70 75 

80 

Tyr Ser Thr Lys Val Leu Thr Gly Asn Trp Met Glu Glu Arg Arg 
Lys 

85 90 95 

20 Gly Leu Pro Tyr Lys His Leu He Thr His His Gin Glu Pro Pro 

• His 

100 105 110 

Arg Tyr Leu He Ser Thr Tyr Asp Asp His Tyr Asn Arg His Gly 
Tyr 

25 115 120 125 

Asn Pro Gly Leu Pro Pro Leu Arg Thr Trp Asn Gly Gin Lys Leu 
Leu 

130 135 140 

Trp Leu 
30 145 



115 



